I was reading a discussion on a social media site about Application Performance Management, and realized that there is a lot of confusion about what is Application Performance Monitoring, Application Performance Management (APM) and IT Operational Analytics (ITOA).
Just looking at the words used, you would believe that Application Performance Monitoring is focused on watching data and monitoring it for a particular condition or state. Application Performance Management would lead you to believe that this is a wider field which includes a range of techniques to certainly monitor the application, but also to manage other aspects of the IT estate. The degree to which complex analytics are used is unclear, but potentially IT Operational Analytics could be seen as a subset of Application Performance Management, although the focus on application might make it more limited in its scope than ITOA.
To help clarify this rather muddy set of terms, we use two models which we find are much clearer and logical, and have less ambiguity than the APM and ITOA definitions.
The Monitoring Maturity Model
The first model we call the Monitoring Maturity Model, because it is a layered model where generally the higher levels are based on data collected from the lower levels. The model is:
1. Infrastructure Monitoring: Collection data on the servers, operating systems, network and storage and setting rule based alerts to catch potential problems.
2. Basic Application Monitoring: From interrogating the Operating System, capture and alert on data about the processes running on the servers. This would include CPU & memory utilization, disk I/O, network I/O etc.
3. Advanced Application Monitoring: Installing a tailored agent on the server which is capturing data specific to the application it is monitoring. This can be "inside the app" data or "outside the app" which is useful for Off the Shelf software products and middleware.
4. Flow Monitoring: This is capturing data about the information passing between applications and monitoring/reporting on data flows. This would include volumes/second, volumes per counterparty, latency etc.
5. Business and IT Analysis: This is the analysis of both business data and "machine" data from levels 1 and 2 to understand the business activity and the behavior of the IT estate.
Monitoring vs Analytics
The second model is separating monitoring from analytics. There is no hard definition which separates them so we break the types of analysis into three types:
1. Detect: This is a rule based detection of an alert condition. This is generally what people mean when they talk about Monitoring.
2. Analyze: This is the collection of lots of data, even data which did not trigger a rule in Detect, and analyzing it to discover more insight. This may be as simple as trends, or as complex as Machine Learning and time series pattern based Anomaly Detection. This would also include techniques like Bayesian Network Causal Analysis.
3. Predict: This uses current and historic data to try and predict future or “what if” scenarios. Again, this can be as simple as extrapolation, or as complex as comparison of current state to empirically derived behavioral data, the likes of which you might have gathered in a performance lab when stress testing an application.
Whichever way you model your IT estate and the behavior of your applications, it is necessary to have a clear language so that people are talking about the same thing.
Guy Warren is CEO of ITRS Group.
The Latest
In the heat of the holiday online shopping rush, retailers face persistent challenges such as increased web traffic or cyber threats that can lead to high-impact outages. With profit margins under high pressure, retailers are prioritizing strategic investments to help drive business value while improving the customer experience ...
In a fast-paced industry where customer service is a priority, the opportunity to use AI to personalize products and services, revolutionize delivery channels, and effectively manage peaks in demand such as Black Friday and Cyber Monday are vast. By leveraging AI to streamline demand forecasting, optimize inventory, personalize customer interactions, and adjust pricing, retailers can have a better handle on these stress points, and deliver a seamless digital experience ...
Broad proliferation of cloud infrastructure combined with continued support for remote workers is driving increased complexity and visibility challenges for network operations teams, according to new research conducted by Dimensional Research and sponsored by Broadcom ...
New research from ServiceNow and ThoughtLab reveals that less than 30% of banks feel their transformation efforts are meeting evolving customer digital needs. Additionally, 52% say they must revamp their strategy to counter competition from outside the sector. Adapting to these challenges isn't just about staying competitive — it's about staying in business ...
Leaders in the financial services sector are bullish on AI, with 95% of business and IT decision makers saying that AI is a top C-Suite priority, and 96% of respondents believing it provides their business a competitive advantage, according to Riverbed's Global AI and Digital Experience Survey ...
SLOs have long been a staple for DevOps teams to monitor the health of their applications and infrastructure ... Now, as digital trends have shifted, more and more teams are looking to adapt this model for the mobile environment. This, however, is not without its challenges ...
Modernizing IT infrastructure has become essential for organizations striving to remain competitive. This modernization extends beyond merely upgrading hardware or software; it involves strategically leveraging new technologies like AI and cloud computing to enhance operational efficiency, increase data accessibility, and improve the end-user experience ...
AI sure grew fast in popularity, but are AI apps any good? ... If companies are going to keep integrating AI applications into their tech stack at the rate they are, then they need to be aware of AI's limitations. More importantly, they need to evolve their testing regiment ...
If you were lucky, you found out about the massive CrowdStrike/Microsoft outage last July by reading about it over coffee. Those less fortunate were awoken hours earlier by frantic calls from work ... Whether you were directly affected or not, there's an important lesson: all organizations should be conducting in-depth reviews of testing and change management ...
In MEAN TIME TO INSIGHT Episode 11, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Secure Access Service Edge (SASE) ...