All Systems Operational
FRA1 - XDR ? Operational
Ingestion Operational
Threat Intelligence for research & triage Operational
Automation Operational
Event storage Operational
Detection Operational
Hunting Operational
Case management Operational
Web application Operational
FRA1 - CTI ? Operational
Search Operational
API consumption Operational
TAXII consumption Operational
MISP consumption Operational
Enrichers Operational
Web application Operational
FRA2 - XDR (SecNumCloud / PCI DSS region) ? Operational
MCO1 - XDR ? Operational
EUR1 - XDR ? Operational
UAE1 - XDR ? Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Past Incidents
Apr 17, 2024

No incidents reported today.

Apr 16, 2024

No incidents reported.

Apr 15, 2024

No incidents reported.

Apr 14, 2024

No incidents reported.

Apr 13, 2024

No incidents reported.

Apr 12, 2024

No incidents reported.

Apr 11, 2024

No incidents reported.

Apr 10, 2024

No incidents reported.

Apr 9, 2024

No incidents reported.

Apr 8, 2024

No incidents reported.

Apr 7, 2024

No incidents reported.

Apr 6, 2024

No incidents reported.

Apr 5, 2024

No incidents reported.

Apr 4, 2024

No incidents reported.

Apr 3, 2024
Resolved - The situation is back to normal even though we are missing incident updates from our cloud provider. All machines are now online and event processing is back in real-time. Our external TCP and HTTP probes are not returning any errors, as well as our APIs.
Apr 3, 03:38 CEST
Monitoring - The web application, API and ingestion endpoints have been fully accessible for a few minutes now. We are still monitoring the situation, as some processing backlog was bufferized internally while the VM were offline.
Apr 3, 02:59 CEST
Identified - The issue has been confirmed by our cloud provider. A generalized incident is ongoing on their Gravelines datacenter. For more info from their side, see https://public-cloud.status-ovhcloud.com/incidents/897ngd9y00sq

It seems that our hosts are gradually coming back online. We are monitoring the recovery of the platform.

Error rate on API and event ingestion is currently going down.

Apr 3, 02:53 CEST
Investigating - Our monitoring system indicates that we lost connectivity to several virtual machines at the same time. This is most likely an issue caused by an incident at our cloud provider. We are investigating.

Events are still being processed but a number of API and intake endpoints are currently returning 50x errors.

Apr 3, 02:37 CEST