4 Tips for Dealing with All Those Event Alerts
July 10, 2013

Ariel Gordon

Share this

IT operations handles hundreds, or even thousands, of console messages day in and day out – including weekends. It’s an ongoing 24x7 battle. Data centers keep expanding and increasing in complexity, yet operations is still expected to manage the flood of event alerts pouring in.

Compounding the problem of the sheer volume of events, these alert notifications typically uses technical language that can only be understood by domain experts and come entirely without context.

So, let’s have a look at some tips that will help IT operations personnel deal with all of this by focusing on important events, while understanding their impact on delivery of business services.

1. Add meaning with enrichment rules

Turn cryptic technical messages into meaningful information with text to describe the event including severity prioritization, owner, and if known the service(s) impacted. The illustration below provides an example. This helps to clarify impact of the event alert and provides guidance about the next steps to be taken.


2. Apply correlation rules

Apply correlation rules to help reduce redundant events displayed on the console. Use filtering rules to remove events below a specific impact level – or events that impact less important components such as test servers. It’s also possible to use de-duplication rules to reduce noise related to the same event.

3. Apply tools that define all business service infrastructure components and their interrelationships

Then, you’ll be able to understand the links between IT events and their associated context and impact on business services.

4. Be proactive to understand the impact of changes in the IT infrastructure

It’s a truism in IT that 80 percent of problems originate from changes. Get in front of those event alerts caused by change so you understand “will an upgrade to that problematic switch port take down the customer portal, or does it only affect ordering supplies?” Ensuring safer changes can eliminate many event alerts.

Ariel Gordon is Chief Technology Officer and Co-Founder of Neebula.

Share this

The Latest

April 19, 2024

In MEAN TIME TO INSIGHT Episode 5, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses the network source of truth ...

April 18, 2024

A vast majority (89%) of organizations have rapidly expanded their technology in the past few years and three quarters (76%) say it's brought with it increased "chaos" that they have to manage, according to Situation Report 2024: Managing Technology Chaos from Software AG ...

April 17, 2024

In 2024 the number one challenge facing IT teams is a lack of skilled workers, and many are turning to automation as an answer, according to IT Trends: 2024 Industry Report ...

April 16, 2024

Organizations are continuing to embrace multicloud environments and cloud-native architectures to enable rapid transformation and deliver secure innovation. However, despite the speed, scale, and agility enabled by these modern cloud ecosystems, organizations are struggling to manage the explosion of data they create, according to The state of observability 2024: Overcoming complexity through AI-driven analytics and automation strategies, a report from Dynatrace ...

April 15, 2024

Organizations recognize the value of observability, but only 10% of them are actually practicing full observability of their applications and infrastructure. This is among the key findings from the recently completed Logz.io 2024 Observability Pulse Survey and Report ...

April 11, 2024

Businesses must adopt a comprehensive Internet Performance Monitoring (IPM) strategy, says Enterprise Management Associates (EMA), a leading IT analyst research firm. This strategy is crucial to bridge the significant observability gap within today's complex IT infrastructures. The recommendation is particularly timely, given that 99% of enterprises are expanding their use of the Internet as a primary connectivity conduit while facing challenges due to the inefficiency of multiple, disjointed monitoring tools, according to Modern Enterprises Must Boost Observability with Internet Performance Monitoring, a new report from EMA and Catchpoint ...

April 10, 2024

Choosing the right approach is critical with cloud monitoring in hybrid environments. Otherwise, you may drive up costs with features you don’t need and risk diminishing the visibility of your on-premises IT ...

April 09, 2024

Consumers ranked the marketing strategies and missteps that most significantly impact brand trust, which 73% say is their biggest motivator to share first-party data, according to The Rules of the Marketing Game, a 2023 report from Pantheon ...

April 08, 2024

Digital experience monitoring is the practice of monitoring and analyzing the complete digital user journey of your applications, websites, APIs, and other digital services. It involves tracking the performance of your web application from the perspective of the end user, providing detailed insights on user experience, app performance, and customer satisfaction ...

April 04, 2024
Modern organizations race to launch their high-quality cloud applications as soon as possible. On the other hand, time to market also plays an essential role in determining the application's success. However, without effective testing, it's hard to be confident in the final product ...