Incident
An incident is a recorded event representing a detected problem with a service, from the moment it is confirmed until it is resolved.
An incident bundles everything about one outage: when it started, its type and severity, the evidence (failed checks, error messages, screenshots), and when it recovered. It is the unit that alerting, reporting, and status pages are built on.
Well-designed monitoring confirms an incident from multiple signals before opening it — avoiding false positives — and closes it automatically once checks pass again.
Related terms
MTTR (Mean Time To Recovery)MTTR (Mean Time To Recovery) is the average time it takes to restore a service after a failure begins.Status PageA status page is a public page that shows the current health of your services and a history of recent incidents and maintenance.False Positive (Alerting)A false positive is an alert that reports a problem which is not actually affecting users — a false alarm.Escalation PolicyAn escalation policy defines who gets notified about an incident, in what order, and how an unacknowledged alert escalates to the next responder.
Start monitoring in minutes
EU-hosted uptime monitoring with multi-location confirmation that kills false alarms — white-label for agencies.