Observability

February 28, 2022

While the pandemic forced enterprises to uproot their traditional network infrastructures to better accommodate masses of people working from home or alternate locations, they are coming to grips with a new reality including many issuing decrees that many employees may never return to the office ...

February 10, 2022

Still not convinced on the value an AIOps platform offers? Consider this: one minute of downtime at Amazon costs the company roughly $220,000 in revenue. With that kind of money on the line, SRE and DevOps teams forced to manage availability by writing rules and querying logs manually are set up to fail — and failure is costly. AIOps is the necessary lift your monitoring tools need to improve performance and cut out the toil for DevOps and IT teams. Here are five ways AIOps does exactly that ...

February 07, 2022

Logs produced by your IT infrastructure contain hidden gems — information about performance, user behavior, and other data waiting to be discovered. Unlocking the value of the array of log data aggregated by organizations every day can be a gateway to uncovering all manner of efficiencies. Yet, the challenge of analyzing and managing the mountains of log data organizations have is growing more complex by the day ...

February 01, 2022

Our growing dependence on the cloud and Internet for business means we must take time to prepare for downtime and latency issues. There are valuable lessons found in most failures, and the Internet outages of 2021 certainly provide ample motivation to revamp processes for mitigating system disruptions. Here are six take-aways from 2021's Internet fails that can be used to increase efficiencies in managing the system infrastructure of any enterprise, no matter its size or sector ...

January 24, 2022

The shift to containers and microservices is a key component of the digital transformation and shift to an all encompassing digital experience that modern customers have grown to expect. But these seismic shifts have also presented a nearly impossible task for IT teams: achieve ceaseless innovation whilst maintaining an ever more complex infrastructure environment, one that tends to produce vast volumes of data. Oh and can you also ensure that these systems are continuously available? ...

January 11, 2022

Having a variety of tools to choose from creates challenges in telemetry data collection. Organizations find themselves managing multiple libraries for logging, metrics, and traces, with each vendor having its own APIs, SDKs, agents, and collectors. An open source, community-driven approach to observability will gain steam in 2022 to remove unnecessary complications by tapping into the latest advancements in observability practice ...

January 10, 2022

These are the trends that will set up your engineers and developers to deliver amazing software that powers amazing digital experiences that fuel your organization's growth in 2022 — and beyond ...

January 06, 2022

In a world where digital services have become a critical part of how we go about our daily lives, the risk of undergoing an outage has become even more significant. Outages can range in severity and impact companies of every size — while outages from larger companies in the social media space or a cloud provider tend to receive a lot of coverage, application downtime from even the most targeted companies can disrupt users' personal and business operations ...

January 05, 2022

Move fast and break things: A phrase that has been a rallying cry for many SREs and DevOps practitioners. After all, these teams are charged with delivering rapid and unceasing innovation to wow customers and keep pace with competitors. But today's society doesn't tolerate broken things (aka downtime). So, what if you can move fast and not break things? Or at least, move fast and rapidly identify or even predict broken things? It's high time to rethink the old rallying cry, and with AI and observability working in tandem, it's possible ...

December 13, 2021

Industry experts offer thoughtful, insightful, and often controversial predictions on how APM, AIOps, Observability, OpenTelemetry, and related technologies will evolve and impact business in 2022. Part 4 covers OpenTelemetry ...

December 09, 2021

Industry experts offer thoughtful, insightful, and often controversial predictions on how APM, AIOps, Observability, OpenTelemetry, and related technologies will evolve and impact business in 2022. Part 3 covers Observability ...

October 26, 2021

APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 2 covers capabilities supported by AIOps, such as visibility and alerting ...

October 18, 2021

Distributed tracing has been growing in popularity as a primary tool for investigating performance issues in microservices systems. Our recent DevOps Pulse survey shows a 38% increase year-over-year in organizations' tracing use. Furthermore, 64% of those respondents who are not yet using tracing indicated plans to adopt it in the next two years ...

September 29, 2021

When a website or app fails or falters, the standard operating procedure is to assemble a sizable team to quickly "divide and conquer" to find a solution. The details of the problem can usually be found somewhere among millions of log events and metrics, leading to slow and painstaking searches that can take hours and often involve handoffs between experts in different areas of the software. The immediate goal in these situations is not to be comprehensive, but rather to troubleshoot until you find a solution that remedies the symptom, even if the underlying root cause is not addressed ...

September 14, 2021

While 90% of respondents believe observability is important and strategic to their business — and 94% believe it to be strategic to their role — just 26% noted mature observability practices within their business, according to the 2021 Observability Forecast ...

September 08, 2021

DevOps, SRE and other operations teams use observability solutions with AIOps to ingest and normalize data to get visibility into tech stacks from a centralized system, reduce noise and understand the data's context for quicker mean time to recovery (MTTR). With AI using these processes to produce actionable insights, teams are free to spend more time innovating and providing superior service assurance. Let's explore AI's role in ingestion and normalization, and then dive into correlation and deduplication too ...

September 07, 2021

As we look into the future direction of observability, we are paying attention to the rise of artificial intelligence, machine learning, security, and more. I asked top industry experts — DevOps Institute Ambassadors — to offer their predictions for the future of observability. The following are 10 predictions ...

August 26, 2021

After more than a year of accelerated innovation, and without the tools and insights to effectively manage technology performance, many IT departments are struggling. The Agents of Transformation 2021 research brought home exactly how hard the last 12 months have hit technologists ...

August 25, 2021

Recent AppDynamics research, Agents of Transformation 2021: The Rise of Full-Stack Observability, found that the speed of implementation for digital transformation programs in financial services has increased by three times over the past year, compared to pre-pandemic levels. This is particularly concerning since the financial services sector has historically led the way with digitization and been particularly innovative in the digital experiences it offers customers ...

August 05, 2021

The average "lifespan" of a company on the Fortune 500 list has dropped from 75 to 15 years, indicating that today, a business' longevity is less to do with industrial decline and leadership, and more influenced by technology and trends, suggesting businesses need to be more agile. As digital transformation continues to change business today, innovative technology like observability with AIOps will play a critical role in helping brands keep up. And as more and more brands implement this innovative technology, there are three main ways they'll see it transform their business ...

July 19, 2021

Anti-patterns involve realizing a problem and implementing a non-optimal solution that is broadly embraced as the go-to method for solving that problem. This solution sounds good in theory, but for one reason or another it is not the best means of solving the problem. Anti-patterns are common across IT as well, especially around application monitoring and observability. One that is particularly prevalent is in response to the increasing complexity of cloud-native infrastructure and applications ...

July 14, 2021

SREs that fail to deliver customer value run the risk of being stuck in an operational toil rut. Conversely, businesses failing to recognize the bi-modal nature and importance of SRE activities run the risk of losing talented employees and their competitive edge ...

July 12, 2021

When you see distressing internet outages occur like the recent Fastly incident that threw a slew of websites offline, I am never surprised by how widespread the problem was, but paradoxically that it wasn't worse ...

June 29, 2021

With every organization now being a digital organization, observability should be viewed as a core competency, not a cutting-edge differentiator, according to The State of Observability 2021, a report from Splunk ...

June 23, 2021

More than half (61%) of respondents reported that their teams are practicing observability, an 8% increase from 2020, signaling that overall adoption is on the rise, according to a 2021 survey from Honeycomb. However, the majority of respondents indicated their teams are at the earliest stages of observability maturity ...

Pages