The Anatomy of APM – 4 Foundational Elements to a Successful Strategy
April 04, 2012

Larry Dragich
Auto Club Group

Share this

By embracing End-User-Experience (EUE) measurements as a key vehicle for demonstrating productivity, you build trust with your constituents in a very tangible way. The translation of IT metrics into business meaning (value) is what APM is all about.

The goal here is to simplify a complicated technology space by walking through a high-level view within each core element. I’m suggesting that the success factors in APM adoption center around the EUE and the integration touch points with the Incident Management process.

When looking at APM at 20,000 feet, four foundational elements come into view:

- Top Down Monitoring (RUM)


- Bottom Up Monitoring (Infrastructure)


- Incident Management Process (ITIL)


- Reporting (Metrics)


Top Down Monitoring

Top Down Monitoring is also referred to as Real-time Application Monitoring that focuses on the End-User-Experience. It has two has two components, Passive and Active. Passive monitoring is usually an agentless appliance which leverages network port mirroring. This low risk implementation provides one of the highest values within APM in terms of application visibility for the business.

Active monitoring, on the other hand, consists of synthetic probes and web robots which help report on system availability and predefined business transactions. This is a good complement when used with passive monitoring to help provide visibility on application health during off peak hours when transaction volume is low.

Bottom Up Monitoring

Bottom Up Monitoring is also referred to as Infrastructure Monitoring which usually ties into an operations manager tool and becomes the central collection point where event correlation happens. Minimally, at this level up/down monitoring should be in place for all nodes/servers within the environment. System automation is the key component to the timeliness and accuracy of incidents being created through the Trouble Ticket Interface.

Incident Management Process

The Incident Management Process as defined in ITIL is a foundational pillar to support Application Performance Management (APM). In our situation, Incident Management, Problem Management, and Change Management processes were already established in the culture for a year prior to us beginning to implement the APM strategies.

A look into ITIL's Continual Service Improvement (CSI) model and the benefits of Application Performance Management indicates they are both focused on improvement, with APM defining toolsets that tie together specific processes in Service Design, Service Transition, and Service Operation.

Reporting Metrics

Capturing the raw data for analysis is essential for an APM strategy to be successful. It is important to arrive at a common set of metrics that you will collect and then standardize on a common view on how to present the real-time performance data.

Your best bet: Alert on the Averages and Profile with Percentiles. Use 5 minute averages for real-time performance alerting, and percentiles for overall application profiling and Service Level Management.

Conclusion

As you go deeper in your exploration of APM and begin sifting through the technical dogma (e.g. transaction tagging, script injection, application profiling, stitching engines, etc.) for key decision points, take a step back and ask yourself why you're doing this in the first place: To translate IT metrics into an End-User-Experience that provides value back to the business.

If you have questions on the approach and what you should focus on first with APM, see Prioritizing Gartner's APM Model for insight on some best practices from the field.

Larry Dragich is Director of Enterprise Application Services at the Auto Club Group.

You can contact Larry on LinkedIn

Larry Dragich of AAA Joins The BSM Blog

For a high-level view of a much broader technology space refer to slide show on BrightTALK.com which describes “The Anatomy of APM - webcast” in more context.

Share this

The Latest

December 02, 2016

There is an increasing recognition of the interconnected nature of the information technology environment. Also, user expectations and IT complexity are rising. As a result, IT infrastructure performance management (IPM) is becoming more popular. Companies practicing IPM are realizing the benefits it delivers to the bottom line. They include the ability to ...

December 01, 2016

In my last blog, I expressed my opinion that IT operations teams may be about to enjoy a renaissance rather than dismally fading away — but only if they adopt new ways of working, measuring themselves and interacting with business stakeholders. In this blog, I'd like to discuss how technology investments can help smooth the way toward operational transformation with a few examples from recent interviews. More specifically, I'd like to focus on three key areas of innovation, all in some way related to Advanced IT Analytics ...

November 30, 2016

Almost one-third (28 percent) of customers will not return to a slow site, according to SOASTA's 2016 Holiday Retail Insights Report ...

November 29, 2016

Black Friday. Retailers know it's coming every year, and still – every year – someone has a spectacular failure. This year Macy's gets top billing – asking customers to wait to shop. Since 500 milliseconds of web delay is estimated to cost 5% of revenue, how much can we guess Macy's lost by asking EVERY shopper, for hours, to wait to shop? It's clearly in the millions of dollars ...

November 28, 2016

The most destructive root cause of 75 percent of outages during big online events like Black Friday and Cyber Monday are unplanned configuration changes to a system – when IT and Ops teams find something they think might cause a problem and try to fix it immediately, unintentionally creating a much bigger issue for the web or mobile site. The following are BigPanda's top recommendations for preventing outages during throughout the entire holiday shopping season ...

November 22, 2016

It's safe to say that the role of IT Operations is changing, but beyond that there are countless opinions about just why and how. Lately I've been hearing a growing number of doomsday prophecies about how operations professionals are going away as they shrink in importance to managing an infrastructure already being replaced by cloud. However, I see a strong and consistent trend that isn't a move away from operations, but rather a deliberate transformation of how IT operations teams work. So which vision is correct? Gloom and doom or new levels of empowerment and rebirth? ...

November 21, 2016

Over the past few years, IT service management (ITSM) has become increasingly important to an organization's IT strategy, and companies are seeking new ways to improve IT service delivery and efficiency via better ITSM processes. Using advanced IT analytics, managers can identify blind spots and hidden gaps in their ITSM process as well as make accurate decisions by monitoring key metrics. Here is how advanced IT analytics can make the best of your IT service desk ...

November 18, 2016

The IoT is in position to become one of the greatest application performance management challenges faced by IT. APMdigest asked experts across the industry for their recommendations on how to ensure performance for IoT applications. Part 4, the final installment of the list, covering communication and the network ...

November 17, 2016

The IoT is in position to become one of the greatest application performance management challenges faced by IT. APMdigest asked experts across the industry for their recommendations on how to ensure performance for IoT applications. Part 3 covers app design and development ...

November 16, 2016

The IoT is in position to become one of the greatest application performance management challenges faced by IT. APMdigest asked experts across the industry – including analysts, consultants and vendors – for their recommendations on how to ensure performance for IoT applications. Part 2 covers data and analytics ...