
Datadog launches domain-specific AI agents & LLM tools
Datadog has announced the addition of three domain-specific AI agents to its generative AI assistant, Bits AI, together with new tools for monitoring and managing large language model (LLM) and agentic AI deployments.
New AI agents
The company has introduced Bits AI SRE, Bits AI Dev Agent, and Bits AI Security Analyst, each configured to serve specific engineering, operations, and security functions. These agents are designed to support real-time incident response, DevOps tasks, and security workflows for development, security, and operations teams.
The AI agents operate on a shared system of core tasks, including data querying, anomaly analysis, and infrastructure scaling. This architecture allows Datadog to roll out new agents efficiently while maintaining consistency in the user experience. The system integrates a broad set of observability data, enabling precise insights and actions for managing risks within cloud-based applications.
Yanbing Li, Chief Product Officer at Datadog, commented on the company's approach: Datadog is uniquely positioned to deliver value with AI as a platform that has a wealth of clean, rich data—we process trillions of data points and are embedded in our customers' critical engineering, developer and security workflows. With these advancements in AI reasoning and multi-modality, we've gone beyond helping organizations understand their availability, security, performance and reliability. We now enable human-in-the-middle workflows by guiding customers on what to look for and where to start looking, and augment their ability to take action.
Bits AI SRE, which is now in limited availability, acts as an on-call responder for incidents by performing early triage and providing investigation findings before human responders intervene. It allocates incidents, produces real-time summaries, and generates initial post-mortem drafts to save teams time.
Bits AI Dev Agent, currently in preview, identifies code issues, suggests fixes, and can open pull requests directly within the source control management systems organisations use. Bits AI Security Analyst, also in preview, automatically investigates cloud security signals, conducts in-depth threat investigations, and produces actionable resolution recommendations, aiming to reduce response times for security incidents.
Darren Trzynka, Senior Cloud Architect at Thomson Reuters, commented on Bits AI's impact: At Thomson Reuters, we're focused on maximizing operational efficiency and accelerating innovation at scale through generative AI solutions. Bits AI allows operations and downstream platform teams to receive the full context of the investigation—from the initial monitor trigger to conclusion—driving down resolution time significantly freeing them up to do more.
Additional Applied AI features
The updates include two new features in preview. Proactive App Recommendations analyses telemetry collected by Datadog to suggest performance improvements or actions, such as optimising slow queries and addressing code issues, before users are impacted. The APM Investigator helps engineers troubleshoot latency spikes by automating bottleneck identification and recommending fixes.
LLM Observability suite announced
Datadog has also released a suite of tools designed to provide observability for agentic AI—software agents built with LLMs and similar technologies—in production environments. The new products include AI Agent Monitoring, LLM Experiments, and AI Agents Console.
Yrieix Garnier, Vice President of Product at Datadog, addressed the motivations behind these offerings: A recent study found only 25 percent of AI initiatives are currently delivering on their promised ROI—a troubling stat given the sheer volume of AI projects companies are pursuing globally. Today's launches aim to help improve that number by providing accountability for companies pushing huge budgets toward AI projects. The addition of AI Agent Monitoring, LLM Experiments and AI Agents Console to our LLM Observability suite gives our customers the tools to understand, optimize and scale their AI investments.
AI Agent Monitoring, now generally available, provides a mapped overview of each agent's decision-making route, including inputs, tool calls, and outputs, displayed in an interactive graph. This enables engineers to diagnose latency spikes or unexpected behaviours and connect them to quality, security, and cost measures across distributed systems.
Mistral AI's Co-founder and CTO, Timothée Lacroix, provided further industry perspective: Agents represent the evolution beyond chat assistants, unlocking the potential of generative AI. As we equip these agents with more tools, comprehensive observability is essential to confidently transition use cases into production. Our partnership with Datadog ensures teams have the visibility and insights needed to deploy agentic solutions at scale.
LLM Experiments, in preview, enables users to compare the effects of changes to prompts or models using datasets from live or uploaded sources. This aims to support quantifiable improvements in cost, response accuracy, and throughput, and prevent unintended regressions in AI application performance.
Michael Gerstenhaber, Vice President of Product at Anthropic, commented: AI agents are quickly graduating from concept to production. Applications powered by Claude 4 are already helping teams handle real-world tasks in many domains, from customer support to software development and R&D. As these agents take on more responsibility, observability becomes key to ensuring they behave safely, deliver value, and stay aligned with user and business goals. We're very excited about Datadog's new LLM Observability capabilities that provide the visibility needed to scale these systems with confidence.
Datadog has also introduced AI Agents Console, currently in preview, to allow organisations to centrally oversee both in-house and third-party AI agents, track their usage and impact, and monitor for potential security or compliance issues as external agents are embedded into critical business workflows.
Armita Peymandoust, Senior Vice President, Software Engineering at Salesforce, said: As enterprises scale digital labour, having clear visibility into how AI agents drive business impact has become mission critical. Customers are already seeing strong success with their AI deployments using Salesforce's Agentforce, which is built on a foundation of openness and trust. That foundation is further strengthened by our partner ecosystem that provides our customers even greater availability to tailored solutions that help them manage their AI agents confidently. Datadog's latest advances in deep observability will further support our vision and unlock another level of AI agent transparency and scale for organizations.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Scoop
3 days ago
- Scoop
Datadog Expands Log Management Offering With New Long-Term Retention, Search And Data Residency Capabilities
Datadog, Inc. (NASDAQ: DDOG), the monitoring and security platform for cloud applications, today at DASH announced new capabilities in its log management suite, which are designed to help organisations optimise logging costs at scale and meet the stringent data retention, auditability and data residency requirements of regulated industries. Logs are critical for threat detection, incident response and audit trails. However, lack of flexibility, high costs and data retention limitations remain roadblocks for security and compliance teams. Financial services, healthcare and insurance companies face similar challenges, having to comply with regulations and maintain full control over sensitive operational data, including their logs. Likewise, organisations operating under regional data residency laws or internal security policies are often required to store data within controlled environments, whether on-premises or in-region cloud infrastructure. These organisations need to remain compliant while having a scalable and efficient log management strategy. Traditional solutions, however, often introduce high costs, operational overhead and fragmented workflows. At its DASH conference in 2023, Datadog launched Flex Logs, which has since become one of its fastest-growing products. Flex Logs decouples the costs of log storage from the costs of querying. It provides both short- and long-term log retention for a nominal monthly fee without sacrificing visibility, enabling streamlined correlation between all of an organisation's logs, metrics and traces. To help companies meet data residency regulations, policies and preferences—while further optimising cost and efficiency—Datadog has launched new log management capabilities that build on the foundation set by Flex Logs. Datadog's latest enhancements enable organisations to support modern SIEM and security workflows while maintaining full visibility, cost consciousness and operational efficiency: Archive Search queries logs from customer-owned cold storage without requiring re-indexing. Archived logs can be searched the same way as logs under retention in the Log Explorer without introducing new tools or extra training. Datadog keeps the user experience consistent, regardless of the age of logs. Flex Frozen is a new storage tier extending log retention to over seven years, eliminating the need for managing and securing external archives. Built for audit-heavy, compliance-driven environments, Flex Frozen simplifies data retention by keeping logs inside Datadog in order to reduce overhead, simplify reporting and analytics, and improve accessibility. CloudPrem enables enterprises to deploy Datadog's indexing and search capabilities within their own infrastructure. Whether it's due to regional data residency laws or internal compliance mandates, customers can now keep their logs local—while continuing to use the Datadog UI and workflows they trust. 'As compliance standards grow more complex and global data regulations tighten, organisations face mounting pressure to retain log data longer, search it faster and keep it where it belongs,' said Michael Whetten, VP of Product at Datadog. 'With today's launches, Datadog makes it easier to manage logs, control their costs and stay compliant without sacrificing performance, accessibility or the user experience.'


Scoop
3 days ago
- Scoop
Datadog's Internal Developer Portal Boosts Engineering Autonomy And Helps Ship Code With Confidence
Press Release – Datadog Datadog IDP accelerates incident response by bringing a live, centralised engineering knowledge base into every incident for faster triage, better decision making and improved coordination. AUCKLAND – JUNE 12, 2025 – Datadog, Inc. (NASDAQ: DDOG), the monitoring and security platform for cloud applications, today at DASH launched its Internal Developer Portal (IDP), which is the first and only developer portal built on live observability data. Engineering teams are under pressure to ship faster while still meeting stricter standards to keep their code reliable, secure, cost effective and compliant with legal, regulatory and company policies. Developers must navigate an expanding set of requirements—including code quality, testing, security scans, infrastructure configurations, observability and compliance. At the same time, they need to understand the systems and services their code depends on, who owns these services, and how they're performing in real time. As this complexity and cognitive load grow, developers increasingly rely on platform engineers to unblock them, which stretches resources for both teams and slows software delivery across the organisation. Datadog IDP gives developers the autonomy to ship quickly and confidently—while meeting production standards and keeping pace with constantly changing systems. Unlike static portals that rely on manual upkeep, Datadog IDP builds on its APM product suite to automatically map services and dependencies, and bring Datadog's real-time performance, service ownership and engineering knowledge together in one place. The new product enables developers to build, test, deploy and monitor software with self-service actions and built-in delivery guardrails, while providing platform engineers with scorecards to help them meet reliability, security and monitoring standards. Datadog IDP accelerates incident response by bringing a live, centralised engineering knowledge base into every incident for faster triage, better decision making and improved coordination. Engineers can focus on solving issues—rather than searching for them across disparate systems—by leveraging these capabilities as part of Datadog's unified platform: Software Catalog: A live system of record showing what software is running, who is responsible for it, and how it is performing across an organisation. This record is automatically synced to live telemetry collected in Datadog, so teams can easily find services, dependencies and their performance metrics, along with critical engineering context such as owners, on-call rotations, source code, runbooks, dashboards and documentation. Self-Service Actions: Pre-built, pre-approved templates powered by Datadog's App Builder and Workflow Automation make it quick and easy for developers to accomplish tasks—like scaffolding a new service, provisioning infrastructure resources or triggering remediation actions—independently while meeting internal requirements. Scorecards: A set of out-of-the-box and custom pass/fail rules that allow platform engineers and engineering managers to track compliance with reliability, security, observability, cost, and other standards across services and teams. Engineering Reports: Out-of-the-box visibility into engineering reliability, software delivery performance and compliance with engineering standards, while offering actionable, personalised views for developers, team leads and executives. 'Datadog's IDP brings together both observed and declared system states, as well as existing systems of record. This combination shows not only developer intention but also what is actually in production. Whether developers onboard new teams or are tasked with complex projects such as migrating code from EC2 to Kubernetes, Datadog automatically provides visibility into their systems and reflects changes as they are being made—without stale metadata or manual syncing,' said Michael Whetten, VP of Product at Datadog. 'Datadog IDP empowers developers to collaborate more effectively and deliver software that meets their organisation's standards, at the pace that is expected from them.' Datadog IDP's service ownership and other information are available across Datadog's unified platform. Status Pages, for example, leverages the same ownership metadata populated through IDP to make it easy to communicate incident scope and impact to stakeholders. And on-call engineers can now query service owners, recent changes and other critical information hands-free from IDP for faster investigations using a Voice Interface. To learn more about Datadog IDP, please visit: Datadog IDP was announced during the keynote at DASH, Datadog's annual conference. The replay of the keynote is available here. During DASH, Datadog also announced launches in AI Observability, Applied AI, AI Security and Log Management. About Datadog Datadog is the observability and security platform for cloud applications. Our SaaS platform integrates and automates infrastructure monitoring, application performance monitoring, log management, user experience monitoring, cloud security and many other capabilities to provide unified, real-time observability and security for our customers' entire technology stack. Datadog is used by organisations of all sizes and across a wide range of industries to enable digital transformation and cloud migration, drive collaboration among development, operations, security and business teams, accelerate time to market for applications, reduce time to problem resolution, secure applications and infrastructure, understand user behavior and track key business metrics.


Techday NZ
4 days ago
- Techday NZ
Datadog unveils IDP to boost developer autonomy & speed
Datadog has introduced its Internal Developer Portal (IDP), billed as the first developer portal built on live observability data, aiming to support engineering teams under increasing demands for faster and more reliable software delivery. Engineering teams reportedly face rising pressure to deliver code that is not only fast and secure but also compliant with legal, regulatory, and internal policies. In this environment, developers are expected to manage a broad span of requirements, including code quality, testing, security scans, infrastructure configurations, observability, and compliance—while also understanding dependencies and real-time system performance. Increasing system complexity and corresponding cognitive load mean that developers increasingly depend on platform engineers to resolve bottlenecks, which, according to Datadog, can slow down software delivery as both groups tackle resource constraints. According to the company, the Datadog IDP is designed to grant developers greater autonomy, enabling them to ship updates quickly while adhering to established standards. The IDP relies on Datadog's Application Performance Monitoring (APM) suite to automatically map services and dependencies. This creates a real-time, unified view of performance, service ownership, and relevant engineering information. The product allows developers to build, test, deploy, and monitor software through self-service actions that include built-in guardrails for delivery. Meanwhile, platform engineers can use scorecards to track compliance with criteria such as reliability, security, and monitoring standards. Capabilities Datadog IDP incorporates several core features designed to support these objectives. The Software Catalog offers a continually updated record of organisational software, including ownership, real-time performance metrics, and links to documentation, dashboards, and source code. The catalog is automatically synchronized to Datadog's telemetry stream. Self-service actions are provided via pre-built templates, facilitating tasks such as provisioning infrastructure or triggering remediation steps without the need for direct intervention from platform engineers. These templates are powered by Datadog's App Builder and Workflow Automation tools. Scorecards, part of the IDP, allow for the setting and monitoring of pass/fail rules in areas such as reliability, security, observability, and cost, with options for both standard and custom criteria. Engineering Reports provide visibility into reliability, performance, and compliance status, supplying targeted views for team leads, developers, and executives. "Datadog's IDP brings together both observed and declared system states, as well as existing systems of record. This combination shows not only developer intention but also what is actually in production. Whether developers onboard new teams or are tasked with complex projects such as migrating code from EC2 to Kubernetes, Datadog automatically provides visibility into their systems and reflects changes as they are being made—without stale metadata or manual syncing," said Michael Whetten, VP of Product at Datadog. "Datadog IDP empowers developers to collaborate more effectively and deliver software that meets their organisation's standards, at the pace that is expected from them." The company states that IDP also enhances incident response by providing a live, central knowledge base for quicker triage and decision making during service outages or other technical incidents. This information is integrated with other tools across the Datadog platform, such as Status Pages, which uses the same ownership metadata to communicate incident scope and impact to stakeholders. Additional functionality includes a voice interface, enabling on-call engineers to query service owners, review recent changes, and access other relevant information hands-free for faster diagnostics and investigations using data from the IDP. The launch of Datadog IDP coincided with the company's announcements in areas including AI observability, applied AI, AI security, and log management.