Grafana Labs Adds Metrics Cost Management to Grafana Cloud
May 09, 2023
Share this

Grafana Labs announced updates to its fully managed Grafana Cloud observability platform: The new Adaptive Metrics feature, which enables teams to aggregate unused and partially used time series data to lower costs, is now available for broader public access.

This feature leverages enhanced insights into metrics usage recently added to Grafana Cloud's Cardinality Management dashboards, which are now available in all Grafana Cloud tiers, both free and paid. Together these advancements, powered by the open source project Grafana Mimir, help organizations rapidly scale at cloud native pace while optimizing metric cardinality and controlling costs.

Grafana Cloud now offers:

- Grafana Cloud’s Cardinality Management dashboards now include insights into the usage of high cardinality metrics, to help distinguish between metrics that are being used and metrics that are unused. The ability to identify high cardinality metrics that are unused in dashboards, queries, recording rules, and alerting rules results in actionable outcomes for SRE or centralized observability teams looking to confidently make data-driven decisions to reduce metrics spend without impacting observability. The Cardinality Management dashboards were first introduced late last year to Grafana Cloud Pro and Advanced customers, but now are generally available to all Grafana Cloud users, including those on the Grafana Cloud Free tier.

- Grafana Cloud's Adaptive Metrics feature takes insights about usage from the Cardinality Management dashboards one step further: It gives users better control of spend on observability metrics by enabling aggregation of unused or partially used metrics. (With partially used metrics, only a subset of the metric’s labels are used.) The Adaptive Metrics aggregation engine transforms these metrics into lower cardinality versions of themselves at ingestion. Unused or partially used labels are stripped from incoming metrics, reducing the total count of time series persisted – and thus the user’s monthly bill. Adaptive Metrics recommends aggregations based on an organization's historic usage patterns, and users can choose which aggregation rules to apply. Dashboards, alerts, and historic queries are guaranteed to continue to work as they did before aggregation, with no rewrites needed. If usage needs change, users can immediately revert back to the unaggregated version of a metric and get the extra detail they need going forward.

Based on results reported by early users, Grafana Cloud Adaptive Metrics can eliminate an estimated 20-50% of an organization’s time series with no perceived impact on their ability to observe their systems.

Grafana Cloud Adaptive Metrics is now available in a public access program for all Grafana Cloud tiers.

"While we've seen the value that Prometheus brings to organizations, we've also seen its popularity lead to rapid adoption and uncontrolled costs," said Tom Wilkie, CTO at Grafana Labs. "In fact, we even had this problem at Grafana Labs, running our own Prometheus monitoring for Grafana Cloud. One of our clusters had grown to over 100 million active series, and 50% of them were unused. We started thinking about how we could solve this problem, and Adaptive Metrics was the answer. We've reduced that cluster by 40%, and we're excited to share this powerful capability with our Grafana Cloud users.”

Share this

The Latest

November 08, 2024

In MEAN TIME TO INSIGHT Episode 11, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Secure Access Service Edge (SASE) ...

November 07, 2024

On average, only 48% of digital initiatives enterprise-wide meet or exceed their business outcome targets according to Gartner's annual global survey of CIOs and technology executives ...

November 06, 2024

Artificial intelligence (AI) is rapidly reshaping industries around the world. From optimizing business processes to unlocking new levels of innovation, AI is a critical driver of success for modern enterprises. As a result, business leaders — from DevOps engineers to CTOs — are under pressure to incorporate AI into their workflows to stay competitive. But the question isn't whether AI should be adopted — it's how ...

November 05, 2024

The mobile app industry continues to grow in size, complexity, and competition. Also not slowing down? Consumer expectations are rising exponentially along with the use of mobile apps. To meet these expectations, mobile teams need to take a comprehensive, holistic approach to their app experience ...

November 04, 2024

Users have become digital hoarders, saving everything they handle, including outdated reports, duplicate files and irrelevant documents that make it difficult to find critical information, slowing down systems and productivity. In digital terms, they have simply shoved the mess off their desks and into the virtual storage bins ...

November 01, 2024

Today we could be witnessing the dawn of a new age in software development, transformed by Artificial Intelligence (AI). But is AI a gateway or a precipice? Is AI in software development transformative, just the latest helpful tool, or a bunch of hype? To help with this assessment, DEVOPSdigest invited experts across the industry to comment on how AI can support the SDLC. In this epic multi-part series to be posted over the next several weeks, DEVOPSdigest will explore the advantages and disadvantages; the current state of maturity and adoption; and how AI will impact the processes, the developers, and the future of software development ...

October 31, 2024

Half of all employees are using Shadow AI (i.e. non-company issued AI tools), according to a new report by Software AG ...

October 30, 2024

On their digital transformation journey, companies are migrating more workloads to the cloud, which can incur higher costs during the process due to the higher volume of cloud resources needed ... Here are four critical components of a cloud governance framework that can help keep cloud costs under control ...

October 29, 2024

Operational resilience is an organization's ability to predict, respond to, and prevent unplanned work to drive reliable customer experiences and protect revenue. This doesn't just apply to downtime; it also covers service degradation due to latency or other factors. But make no mistake — when things go sideways, the bottom line and the customer are impacted ...

October 28, 2024

Organizations continue to struggle to generate business value with AI. Despite increased investments in AI, only 34% of AI professionals feel fully equipped with the tools necessary to meet their organization's AI goals, according to The Unmet AI Needs Surveywas conducted by DataRobot ...