Datadog announced an expanded strategic partnership with Google Cloud, which enables Google Cloud customers to proactively observe and secure their cloud-native and hybrid applications within Datadog's unified platform.
As part of the expanded partnership and integrations, Datadog integrates with Vertex AI, allowing AI ops teams and developers to monitor, analyze and optimize the performance of their machine learning models in production.
"Google Cloud continues to be a key partner for Datadog as we jointly help global businesses observe and secure their cloud applications," said Yrieix Garnier, VP of Product at Datadog. "The new Vertex AI integration expands this partnership and gives AI and ML developers full observability into their production applications built on Vertex AI. With out-of-the-box dashboards and real-time monitors, customers can get started quickly and ensure their models are performing at an optimal level while delivering predictions responsively at scale and without errors."
"Generative AI is fundamentally changing how many businesses operate, fueling a new era of cloud that can benefit virtually every area of an organization," said Kevin Icchpurani, Corporate VP, Global Partner Ecosystem & Channels at Google Cloud. "By applying Vertex AI, Datadog can help AI teams improve how they monitor and analyze the performance of machine learning models, ensuring they are functioning correctly and creating optimal value."
Datadog's integration with Vertex AI provides developers full observability on the prediction performance and resource utilization of their custom AI/ML models. The integration provides an out-of-the-box dashboard with prediction counts, latency, errors and resource (CPU/Memory/Network) utilization grouped by deployed models so teams can compare model performance side-by-side in production environments. It also helps detect data anomalies in order to maintain the reliability and robustness of machine learning applications.
Other new and expanded Google Cloud integrations that were recently announced include:
- Serverless monitoring: Datadog now offers in-depth support for Google Cloud Run—the leading serverless compute technology on Google Cloud. With native distributed tracing across all runtimes and the ability to collect custom metrics and logs, Datadog provides deep insights into customers' Cloud Run workloads as well as fully managed APIs, queues, streams and data stores.
- Google Cloud Ready - Cloud SQL: Datadog has earned the Google Cloud Ready designation for the Google Cloud SQL integration, providing visibility into the performance and health of Cloud SQL to customers. This integration monitors throughput, memory and availability metrics in customers' databases from MySQL, PostgreSQL and SQL Server.
- Google Security Command Center: Customers can now send their Google Cloud Security Command Center findings to Datadog, including vulnerabilities, threats and errors from containers and virtual machines. Using Datadog Cloud SIEM, customers can automatically generate signals and perform investigations.
- Quick setup: Datadog's new setup experience allows Google Cloud customers to get started in just seconds so they can monitor their entire Google Cloud environment, even when there are thousands of projects. New projects can also be auto-discovered to ensure complete and seamless monitoring coverage. With out-of-the-box dashboards and real-time monitors on over 30+ Google Cloud integrations, customers can begin monitoring their services in just a few clicks.
These integrations are available now.
The Latest
In the heat of the holiday online shopping rush, retailers face persistent challenges such as increased web traffic or cyber threats that can lead to high-impact outages. With profit margins under high pressure, retailers are prioritizing strategic investments to help drive business value while improving the customer experience ...
In a fast-paced industry where customer service is a priority, the opportunity to use AI to personalize products and services, revolutionize delivery channels, and effectively manage peaks in demand such as Black Friday and Cyber Monday are vast. By leveraging AI to streamline demand forecasting, optimize inventory, personalize customer interactions, and adjust pricing, retailers can have a better handle on these stress points, and deliver a seamless digital experience ...
Broad proliferation of cloud infrastructure combined with continued support for remote workers is driving increased complexity and visibility challenges for network operations teams, according to new research conducted by Dimensional Research and sponsored by Broadcom ...
New research from ServiceNow and ThoughtLab reveals that less than 30% of banks feel their transformation efforts are meeting evolving customer digital needs. Additionally, 52% say they must revamp their strategy to counter competition from outside the sector. Adapting to these challenges isn't just about staying competitive — it's about staying in business ...
Leaders in the financial services sector are bullish on AI, with 95% of business and IT decision makers saying that AI is a top C-Suite priority, and 96% of respondents believing it provides their business a competitive advantage, according to Riverbed's Global AI and Digital Experience Survey ...
SLOs have long been a staple for DevOps teams to monitor the health of their applications and infrastructure ... Now, as digital trends have shifted, more and more teams are looking to adapt this model for the mobile environment. This, however, is not without its challenges ...
Modernizing IT infrastructure has become essential for organizations striving to remain competitive. This modernization extends beyond merely upgrading hardware or software; it involves strategically leveraging new technologies like AI and cloud computing to enhance operational efficiency, increase data accessibility, and improve the end-user experience ...
AI sure grew fast in popularity, but are AI apps any good? ... If companies are going to keep integrating AI applications into their tech stack at the rate they are, then they need to be aware of AI's limitations. More importantly, they need to evolve their testing regiment ...
If you were lucky, you found out about the massive CrowdStrike/Microsoft outage last July by reading about it over coffee. Those less fortunate were awoken hours earlier by frantic calls from work ... Whether you were directly affected or not, there's an important lesson: all organizations should be conducting in-depth reviews of testing and change management ...
In MEAN TIME TO INSIGHT Episode 11, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses Secure Access Service Edge (SASE) ...