Tools for organizations to identify & respond to unplanned IT events
$36.00/month
Any event that may lead to a disruption in your organization’s services can be described as an incident, and it is often necessary to have a set framework for handling these events. Datadog's Incident Management feature offers a streamlined system for identifying and resolving incidents. With this tool, you can detect, prioritize, and monitor incidents seamlessly without switching contexts. Assess severity, involve the necessary teams and resources, and collaborate directly within the app or through your preferred communication platforms. Keep track of incidents on an intuitive timeline and generate post mortem reports.
Top Features
Declare, manage, and investigate incidents from multiple sources
Pivot from alert to chat room to timeline with no loss of context
Leverage a collaborative workflow with the Datadog Slack App
Set up webhooks from monitors and runbooks for autoremediation
Integrations with paging and communication tools
Collect data and signals from across the platform
Export to Datadog Notebooks and other documentation tools
Recommended products
Create, investigate, and resolve incidents from triggered alerts and security signals
- Manage incidents from detection to resolution within Datadog’s web and mobile app, and our Slack app, with no context-switching.
- Leverage automatic integrations with the tools you already use.
- Simplify your incident management process with automated out-of-the-box workflows.

Build a collaborative and dynamic response team
- Bring in relevant people and teams instantly with tagging, roles, and real-time collaboration.
- Utilize incident-specific communication streams while working with stakeholders throughout an investigation.
- Work together on interactive timelines that capture all incident-related context.

Preserve data from incident declaration to resolution with Datadog Postmortem Notebooks
- Automatically generate postmortems with incident data, or export data to the tool of your choice.
- Track progress across the response team with real-time collaboration.
- Include graphs from any data source and scope them to the exact time of impact.

Improve your response with monitoring data from across your stack
- Dive deeper into logs, traces, network, infrastructure, and more to find the root cause.
- Validate issues and outages from either your desktop or the Datadog mobile app without losing context.
- Pull related graphs, visualizations, and other pieces of evidence into the timeline for real-time monitoring throughout the incident.

Additional Information
Terms & Conditions
Terms of Service
https://www.datadoghq.com/legal/terms/Privacy Policy
https://www.datadoghq.com/legal/privacy/Resources
Datadog Incident Management - About
In this session, Technical Evangelist Ara Pulido will chat with Léo Cavaillé, SRE Manager, and Matt Hardwick, an engineer working on Datadog’s incident application, discuss how incident management evolved at Datadog.