OpsRamp announced OpsQ Observed Mode to build confidence in machine learning models for IT event and performance analysis, as part of the Summer 2019 Release which also introduces automated alert suppression to reduce human time spent on first-response to alerts, continuous learning-based alert escalation using live event data, and new infrastructure monitoring capabilities for cloud native environments.
According to OpsRamp’s 2019 State of AIOps report, 67% of respondents have concerns about the relevance and reliability of the insights delivered by artificial intelligence for IT operations (AIOps) tools. OpsQ Observed Mode enables IT teams to assess the accuracy of machine-learning-driven correlation decisions in preview mode, enhancing the integrity of data for improved decision-making.
Highlights of the OpsRamp Summer 2019 release include:
Service-Centric AIOps: OpsQ is OpsRamp’s intelligent event management, alert correlation, and remediation solution. New OpsQ capabilities help IT teams drive faster incident prioritization and rapid mean-time-to-resolution (MTTR) for dynamic infrastructure workloads and include:
- OpsQ Observed Mode: OpsQ Observed Mode helps incident management teams assess the accuracy of the OpsRamp machine learning algorithms in a live production environment before they take effect. Observed Mode creates shadow inferences that show alert correlation decisions that OpsQ would have made if enabled.
- Learning-Based Auto-Alert Suppression: OpsQ looks for recurring alert patterns in production environments and suppresses those alerts that occur at a predictable cadence. OpsQ uses seasonality-based and attribute-based auto-alert suppression techniques as a first-response mechanism so that incident responders no longer have to acknowledge, process, and triage every alert that they receive.
- Automatic Resource Creation from Third-Party Events: OpsQ now has the ability to auto-extract metadata for resources managed by other tools and use this information to automatically contextualize future alerts from these resources.
- Continuous Learning for Alert Escalation: Alert escalation policies support a continuous learning option for auto-incident creation. The OpsRamp platform continuously re-trains its machine learning models using live alert data, adapting to dynamic environments.
Service and Topology Maps: The Summer 2019 Release introduces new impact visibility and service context features that deliver dynamic relationship data for public cloud services and actionable insights for understanding cross-site interconnections.
- Cloud Topology for AWS: The new AWS topology map shows dependency information for cloud resources such as AWS EC2, VPC, RDS, or ELB instances so that DevOps teams can keep track of all the different moving parts in their public cloud estate.
- Cross-Site Connection Topology: OpsRamp network topology maps now incorporate routing layer relationships (BGP and OSPF) across WAN links.
Cloud Native Discovery and Monitoring: DevOps and site reliability engineering (SRE) teams can now monitor popular open source applications used in cloud native environments and access relevant performance insights for Mesosphere and Azure Stack in the OpsRamp platform.
- Out-of-the-Box Kubernetes Dashboards: OpsRamp can automatically create performance management dashboards for Kubernetes environments. IT teams can gain instant visibility into the health of containerized deployments by tracking cluster, pod, and node level metrics.
- Expanded Application Monitoring: OpsRamp now provides agentless monitoring for commonly used applications (ActiveMQ, Apache Spark, Apache Solr, CockroachDB, Couchbase, Apache CouchDB, Elastic Search, Fluentd, Neo4j Graph Platform, RabbitMQ) within the cloud and cloud native stacks.
- Mesosphere: OpsRamp can now discover and monitor Mesosphere-based cloud native environments. The integration captures performance metrics for Mesos master and agent nodes that help optimize and scale modern enterprises apps built on dynamic infrastructure.
- Azure Stack: OpsRamp can discover and monitor network connections, virtual networks and load balancers in an Azure Stack environment. Cloud admins can analyze the availability and performance of their hybrid infrastructure in Azure Stack through the integration.
“Our customers have told us that they’d like to see how AIOps inferences proactively detect, diagnose, and address service continuity issues. OpsQ Observed Mode is a no-risk option for IT operations and DevOps teams to assess the accuracy and power of machine intelligence-driven event management, ” said Mahesh Ramachandran, VP of Product Management for OpsRamp. “The Summer 2019 Release provides modern IT infrastructure teams the real-time intelligence to fix visibility gaps in their hybrid and multi-cloud environments.”
The OpsRamp Summer 2019 Release also includes new synthetic monitoring capabilities, service map enhancements, bulk export of operational data for data mining, and monitoring of integration failures.
The Latest
In a fast-paced industry where customer service is a priority, the opportunity to use AI to personalize products and services, revolutionize delivery channels, and effectively manage peaks in demand such as Black Friday and Cyber Monday are vast. By leveraging AI to streamline demand forecasting, optimize inventory, personalize customer interactions, and adjust pricing, retailers can have a better handle on these stress points, and deliver a seamless digital experience ...
Broad proliferation of cloud infrastructure combined with continued support for remote workers is driving increased complexity and visibility challenges for network operations teams, according to new research conducted by Dimensional Research and sponsored by Broadcom ...
New research from ServiceNow and ThoughtLab reveals that less than 30% of banks feel their transformation efforts are meeting evolving customer digital needs. Additionally, 52% say they must revamp their strategy to counter competition from outside the sector. Adapting to these challenges isn't just about staying competitive — it's about staying in business ...
Leaders in the financial services sector are bullish on AI, with 95% of business and IT decision makers saying that AI is a top C-Suite priority, and 96% of respondents believing it provides their business a competitive advantage, according to Riverbed's Global AI and Digital Experience Survey ...
SLOs have long been a staple for DevOps teams to monitor the health of their applications and infrastructure ... Now, as digital trends have shifted, more and more teams are looking to adapt this model for the mobile environment. This, however, is not without its challenges ...
Modernizing IT infrastructure has become essential for organizations striving to remain competitive. This modernization extends beyond merely upgrading hardware or software; it involves strategically leveraging new technologies like AI and cloud computing to enhance operational efficiency, increase data accessibility, and improve the end-user experience ...
AI sure grew fast in popularity, but are AI apps any good? ... If companies are going to keep integrating AI applications into their tech stack at the rate they are, then they need to be aware of AI's limitations. More importantly, they need to evolve their testing regiment ...
If you were lucky, you found out about the massive CrowdStrike/Microsoft outage last July by reading about it over coffee. Those less fortunate were awoken hours earlier by frantic calls from work ... Whether you were directly affected or not, there's an important lesson: all organizations should be conducting in-depth reviews of testing and change management ...
In MEAN TIME TO INSIGHT Episode 11, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Secure Access Service Edge (SASE) ...
On average, only 48% of digital initiatives enterprise-wide meet or exceed their business outcome targets according to Gartner's annual global survey of CIOs and technology executives ...