APMdigest asked the top minds in the industry what they think AIOps can do for IT Operations. Part 4 covers root cause analysis and automation.
Start with What Can AIOps Do For IT Ops? - Part 1
Start with What Can AIOps Do For IT Ops? - Part 2
Start with What Can AIOps Do For IT Ops? - Part 3
SINGLE PANE OF GLASS
AIOps provides a much needed real-time "single-pane-of-glass" view into complex IT infrastructures that encompass fragmented and distributed multi-vendor, multi-domain technologies including legacy, virtualization, hybrid cloud, containers, microservices, and others. Although AIOps is a seismic change for IT operations, it's not a radical application of analytics and machine learning. The potential of AIOps is enormous. Enterprises that have deployed AIOps solutions are experiencing transformational benefits in revenue growth, better customer retention, improved customer experience, lower costs, and enhanced performance. The time to move is now.
Maruti Sivakumar V
SVP, Head of Digital & Practices, Blue.cloud
ISOLATING THE ROOT CAUSE
AIOps helps build high-quality incidents that include all the necessary technical and business context, alongside AI/ML-identified probable root cause and root cause changes — and present it all within a single pane of glass.
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda
AIOps is a buzzword 6 different types of products designed to create value for IT Operations professionals. Always pick specific use cases you wish to solve and then understand how machine learning and AI can apply to solve that issue or set of issues. Good examples of this are to help the user isolate the root cause down to a specific component, highlight outliers in graphs and other views, correlate likely related data types together. Generally, these technologies help augment the operator of the software versus being automation magic. Most often these are features in other Observability tools versus AIOps platforms. AIOps platforms are fantasy because the semantic meaning of data is not clear. The result is vendors write rules to analyze the data, making the resulted outcomes only work in specific situations which makes them useless when a major problem happens across a set of complex systems.
Jonah Kowall
CTO, Logz.io
AUTOMATED ROOT CAUSE ANALYSIS
Response automation is one of the most value-driving features of AIOps software tools. IT operators are able to conduct performance tests to establish a baseline for each metric or KPI and define acceptable thresholds for the ones they want to prioritize. When a KPI breach is detected, AIOps software can perform an automated root cause analysis to automatically determine why a problem occurred and implement a solution if one is available.
Abel Gonzalez
Director of Product Marketing, Sumo Logic
Machine learning and AI are not just critical — but foundational — components of a dynamic monitoring platform. Modern applications are constantly in flux, and microservices scale through ephemeral cloud and container infrastructure in response to demand. As these systems become more complex and dynamic, operational tasks consume an increasing share of engineering time. AIOps optimizes and automates IT operations so that engineers can get proactively alerted no matter the size of the workloads, and benefit from an augmented troubleshooting experience by cutting through noise to glean key insights. In some cases, AI can auto-discover the root cause of an issue, saving minutes or hours of stressful investigations. This is the core advantage of effective AIOps — less engineering time wasted on managing complex operations, and more time building new products for customers.
Renaud Boutet
VP of Product, Datadog
BETTER DECISION-MAKING
From a monitoring and observability perspective, a key benefit of AIOps has been the ability to use historical data to increase confidence in decisions that we previously thought were black-and-white. It's relatively simple to have a machine check if a service is up or down, but how do we find the trends that show that whilst the website is up, it's gradually been getting slower over the past few months? Modern tooling allows us to collect enough data and process it fast enough — often in real-time — for the machines to be able to make better-informed decisions, faster. Such decisions could only be made by lengthy human inspection previously. It's a great example of modern tooling working in the background to make sure everything is okay, so we don't have to.
Matt Saunders
Head of DevOps, Adaptavist
AIOps observability can play a critical role in terms of expected trends using the data from users, systems and processes and provide the data back to the decision-makers to make the investment call based on the pattern, trends, etc. With growing Cloud demand, it is imperative the enterprises start investing in AIOps before it is too late.
Vishnu Vasudevan
Head of Product Engineering and Management, Opsera
SYNCING WITH ITSM
Create automated, bi-directional syncing with your ITSM platform, on-call or other collaboration tools and reduce ticket/notification volumes by up to 95%
Mohan Kompella, VP Product Marketing,
Adam Blau, Director of Product Marketing,
Anirban Chatterjee, Director of Product Marketing, BigPanda
First generation AIOps solutions are a step in the right direction, to address the unending IT complexity, but needed more care and feed and only solved limited set of problems for ITOps teams. Looking ahead, new age AIOps platforms are poised to make AIOps faster, better and cheaper — by automating data preparations and integrations, by having native asset/topology intelligence and by using expanded AI/ML frameworks like neural networks, NLP, transformer models and graph databases to address a lot more use cases. This paves a path where everybody in the IT benefits — ITSM, Service Desk, IT Asset/Planning and more.
Tejo Prayaga
Product Management, CloudFabrix
UNDERSTANDING ALGORITHMS
The last several years have seen a dramatic increase in the use of AI across all types of companies and platforms. These complex solutions require more parts of an organization to be knowledgeable of AI, from data pipelines to the workflows that build, qualify and optimize the models. Having a specialized Ops function that understands this end-to-end is going to be critical for maximizing AI's effectiveness in a production environment. Over time, AIOps can build a deeper understanding of the algorithms, then use that knowledge to enhance the infrastructure with automated services around data cleaning, model tuning and scaling that will continue delivering key results for the business. This kind of specialty is beyond what a traditional IT Operations team can do with the breadth that they are normally expected to maintain.
David Luks
VP of Engineering, Smart Applications, Lucidworks
AUTOMATION
AIOps delivers significant value to businesses by automating many of the manual, tedious tasks that distract IT from working on higher level projects, especially when it comes to data prep.
David P. Mariani
CTO and Founder, AtScale
As the cadence of business continues to gain momentum and competition builds, organizations must not only innovate but also identify business problems and inefficiencies and utilize technology to overcome them. AIOps acts as the salve for many enterprise challenges by anchoring a triangulation of machine learning, decision automation and advanced analytics to automate repetitive tasks, freeing IT teams to work on new mission critical and challenging problems — resulting in faster completion of projects and improved business outcomes.
Alan Young
CPO, InRule
REMEDIAL OPTIMIZATION
IT Operations cannot keep up with the requirements of keeping cloud applications functional and running their best. IT Ops needs to utilize the power of AI to keep the many combinations of app parameters and metrics in an optimal state. Moreso, for AIOps to keep operational apps optimized it needs to be continuous (always on) and autonomous (no human intervention). This way AIOps can perform the remedial optimization work the IT Ops SREs would do, but much faster and with more accuracy.
Peter Nickolov
Co-Founder and VP of Engineering, Opsani
The Latest
Broad proliferation of cloud infrastructure combined with continued support for remote workers is driving increased complexity and visibility challenges for network operations teams, according to new research conducted by Dimensional Research and sponsored by Broadcom ...
New research from ServiceNow and ThoughtLab reveals that less than 30% of banks feel their transformation efforts are meeting evolving customer digital needs. Additionally, 52% say they must revamp their strategy to counter competition from outside the sector. Adapting to these challenges isn't just about staying competitive — it's about staying in business ...
Leaders in the financial services sector are bullish on AI, with 95% of business and IT decision makers saying that AI is a top C-Suite priority, and 96% of respondents believing it provides their business a competitive advantage, according to Riverbed's Global AI and Digital Experience Survey ...
SLOs have long been a staple for DevOps teams to monitor the health of their applications and infrastructure ... Now, as digital trends have shifted, more and more teams are looking to adapt this model for the mobile environment. This, however, is not without its challenges ...
Modernizing IT infrastructure has become essential for organizations striving to remain competitive. This modernization extends beyond merely upgrading hardware or software; it involves strategically leveraging new technologies like AI and cloud computing to enhance operational efficiency, increase data accessibility, and improve the end-user experience ...
AI sure grew fast in popularity, but are AI apps any good? ... If companies are going to keep integrating AI applications into their tech stack at the rate they are, then they need to be aware of AI's limitations. More importantly, they need to evolve their testing regiment ...
If you were lucky, you found out about the massive CrowdStrike/Microsoft outage last July by reading about it over coffee. Those less fortunate were awoken hours earlier by frantic calls from work ... Whether you were directly affected or not, there's an important lesson: all organizations should be conducting in-depth reviews of testing and change management ...
In MEAN TIME TO INSIGHT Episode 11, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Secure Access Service Edge (SASE) ...
On average, only 48% of digital initiatives enterprise-wide meet or exceed their business outcome targets according to Gartner's annual global survey of CIOs and technology executives ...
Artificial intelligence (AI) is rapidly reshaping industries around the world. From optimizing business processes to unlocking new levels of innovation, AI is a critical driver of success for modern enterprises. As a result, business leaders — from DevOps engineers to CTOs — are under pressure to incorporate AI into their workflows to stay competitive. But the question isn't whether AI should be adopted — it's how ...