The Anatomy of APM – 4 Foundational Elements to a Successful Strategy
April 04, 2012

Larry Dragich
Auto Club Group

Share this

By embracing End-User-Experience (EUE) measurements as a key vehicle for demonstrating productivity, you build trust with your constituents in a very tangible way. The translation of IT metrics into business meaning (value) is what APM is all about.

The goal here is to simplify a complicated technology space by walking through a high-level view within each core element. I’m suggesting that the success factors in APM adoption center around the EUE and the integration touch points with the Incident Management process.

When looking at APM at 20,000 feet, four foundational elements come into view:

- Top Down Monitoring (RUM)


- Bottom Up Monitoring (Infrastructure)


- Incident Management Process (ITIL)


- Reporting (Metrics)


Top Down Monitoring

Top Down Monitoring is also referred to as Real-time Application Monitoring that focuses on the End-User-Experience. It has two has two components, Passive and Active. Passive monitoring is usually an agentless appliance which leverages network port mirroring. This low risk implementation provides one of the highest values within APM in terms of application visibility for the business.

Active monitoring, on the other hand, consists of synthetic probes and web robots which help report on system availability and predefined business transactions. This is a good complement when used with passive monitoring to help provide visibility on application health during off peak hours when transaction volume is low.

Bottom Up Monitoring

Bottom Up Monitoring is also referred to as Infrastructure Monitoring which usually ties into an operations manager tool and becomes the central collection point where event correlation happens. Minimally, at this level up/down monitoring should be in place for all nodes/servers within the environment. System automation is the key component to the timeliness and accuracy of incidents being created through the Trouble Ticket Interface.

Incident Management Process

The Incident Management Process as defined in ITIL is a foundational pillar to support Application Performance Management (APM). In our situation, Incident Management, Problem Management, and Change Management processes were already established in the culture for a year prior to us beginning to implement the APM strategies.

A look into ITIL's Continual Service Improvement (CSI) model and the benefits of Application Performance Management indicates they are both focused on improvement, with APM defining toolsets that tie together specific processes in Service Design, Service Transition, and Service Operation.

Reporting Metrics

Capturing the raw data for analysis is essential for an APM strategy to be successful. It is important to arrive at a common set of metrics that you will collect and then standardize on a common view on how to present the real-time performance data.

Your best bet: Alert on the Averages and Profile with Percentiles. Use 5 minute averages for real-time performance alerting, and percentiles for overall application profiling and Service Level Management.

Conclusion

As you go deeper in your exploration of APM and begin sifting through the technical dogma (e.g. transaction tagging, script injection, application profiling, stitching engines, etc.) for key decision points, take a step back and ask yourself why you're doing this in the first place: To translate IT metrics into an End-User-Experience that provides value back to the business.

If you have questions on the approach and what you should focus on first with APM, see Prioritizing Gartner's APM Model for insight on some best practices from the field.

Larry Dragich is Director of Enterprise Application Services at the Auto Club Group.

You can contact Larry on LinkedIn

Larry Dragich of AAA Joins The BSM Blog

For a high-level view of a much broader technology space refer to slide show on BrightTALK.com which describes “The Anatomy of APM - webcast” in more context.

Share this

The Latest

August 29, 2016

The cloud revolution has affected all facets of the IT realm, including network and application monitoring. SNMP monitoring gives us the status of our devices, but doesn’t capture the end-user experience. We need to know what users experience regardless of what device, network and ISP connects them to cloud applications ...

August 26, 2016

A little more than 70 percent of federal IT decision makers surveyed said their agency runs important applications on outdated IT systems, according to Dell's new State of IT Trends 2016 ...

August 25, 2016

Despite organizations pursuing multi-cloud strategies to ensure business continuity, resilience and performance, research conducted by Turbonomic and Verizon found that cloud purchasing decisions are still almost entirely focused on pricing ...

August 24, 2016

While service catalogs are not new, they are becoming increasingly critical to enterprises seeking to optimize IT efficiencies, service delivery and business outcomes. They are also a way of supporting both enterprise and IT services, as well as optimizing IT for cost and value with critical metrics and insights. In this blog, we'll look at how and why service catalogs are becoming ever more important both to IT organizations and to the businesses and organizations they serve ...

August 23, 2016

What is needed to create a next-generation network management tool? Nothing less than the development of a sophisticated network-aware orchestration engine that is able to detect any interdependencies, resolve them and deploy network policies automatically over the network ...

August 22, 2016

The challenge today for network operations (NetOps) is how to maintain and evolve the network while demand for network services continues to grow. Software-Defined Networking (SDN) promises to make the network more agile and adaptable. Various solutions exist, yet most are missing a layer to orchestrate new features and policies in a standardized, automated and replicable manner while providing sufficient customization to meet enterprise-level requirements ...

August 19, 2016

ScaleArc's Summer Blockbuster Survey found that 62 percent of Americans said they would be upset if they were purchasing movie tickets and the site or app went down, and 90 percent agreed that movie ticketing websites and apps should have no downtime this summer ...

August 18, 2016

This blog talks about end-user expectations in terms of felt or experienced performance of applications or desktops delivered by technology which is called VDI, Desktop Virtualization, Remote Desktop, App Virtualization …

August 17, 2016

Monitoring your middleware platforms with a consolidated monitoring application has been shown over and over to reduce the frequency and duration of severity 1 and 2 incidents and prevent losses of revenue attributed to downtime. However, making a strong business care for end-end monitoring and middleware monitoring can be challenging and can present unique learning opportunities. Here are some recommendations to help you make a better business case ...

August 16, 2016

Organizations are embracing IoT as part of their strategic initiatives, with over 70% of respondents indicating that IoT is “essential” or “important” to their organization’s business and technical strategies, according to new research by Enterprise Management Associates (EMA), titled The Rise of the Internet of Things: Connecting Our World One Device at a Time ...