Resolved -
Today from 20:15-21:30 UTC a small sub-set of customers in the prod-us-east-0 region could see missing recording rule samples from this time period. This incident is currently resolved.
Dec 11, 20:00 UTC
Resolved -
The incident has been resolved. The cause was one database server being under heavy CPU load - that caused database queries to either take a long time to complete or fail altogether for Grafana Cloud instances using that specific database server. As an outcome, it made some instances unavailable, and it also made new instance startups fail.
Dec 11, 11:49 UTC
Monitoring -
A fix has been implemented and we are monitoring the results.
Dec 11, 11:02 UTC
Identified -
The issue has been identified and the fix is being implemented. Majority of stacks should be accessible again.
Dec 11, 10:48 UTC
Investigating -
We are currently investigating an issue impacting the availability of Grafana Cloud instances in the prod-eu-west-2 region.
Dec 11, 10:28 UTC
Resolved -
This incident has been resolved.
Dec 11, 01:42 UTC
Monitoring -
A fix has been implemented, and we are monitoring the results.
Dec 10, 23:07 UTC
Investigating -
At approximately 22:00 UTC our team was alerted to an issue causing elevated errors in some tenants in the prod-us-east-0 region. Users in the impacted region may experience errors while attempting to query metrics. We are actively taking steps to mitigate and resolve this issue.
Dec 10, 22:27 UTC