Resolved -
Today from 20:15-21:30 UTC a small sub-set of customers in the prod-us-east-0 region could see missing recording rule samples from this time period. This incident is currently resolved.
Dec 11, 20:00 UTC
Resolved -
The incident has been resolved. The cause was one database server being under heavy CPU load - that caused database queries to either take a long time to complete or fail altogether for Grafana Cloud instances using that specific database server. As an outcome, it made some instances unavailable, and it also made new instance startups fail.
Dec 11, 11:49 UTC
Monitoring -
A fix has been implemented and we are monitoring the results.
Dec 11, 11:02 UTC
Identified -
The issue has been identified and the fix is being implemented. Majority of stacks should be accessible again.
Dec 11, 10:48 UTC
Investigating -
We are currently investigating an issue impacting the availability of Grafana Cloud instances in the prod-eu-west-2 region.
Dec 11, 10:28 UTC
Resolved -
This incident has been resolved.
Dec 11, 01:42 UTC
Monitoring -
A fix has been implemented, and we are monitoring the results.
Dec 10, 23:07 UTC
Investigating -
At approximately 22:00 UTC our team was alerted to an issue causing elevated errors in some tenants in the prod-us-east-0 region. Users in the impacted region may experience errors while attempting to query metrics. We are actively taking steps to mitigate and resolve this issue.
Dec 10, 22:27 UTC
Resolved -
A fix has been implemented and we are monitoring the results. All affected instances should now be online and functioning normally.
Dec 4, 21:54 UTC
Update -
We are continuing to monitor for any further issues.
Dec 4, 18:35 UTC
Monitoring -
A fix has been implemented and we are monitoring the results. All affected instances should now be online and functioning normally.
Dec 4, 18:33 UTC
Identified -
Issue has been identified and a fix is being implemented.
Dec 4, 18:32 UTC
Investigating -
We are currently investigating an issue impacting the availability of hosted Grafana instances in the prod-eu-west-2 region.
Dec 4, 18:24 UTC
Resolved -
A fix has been implemented and we are monitoring the results. All affected instances should now be online and functioning normally.
Dec 4, 15:05 UTC
Monitoring -
A fix has been implemented and we are monitoring the results. All affected instances should now be online and functioning normally.
Dec 4, 15:03 UTC
Identified -
Issue has been identified and a fix is being implemented.
Dec 4, 15:00 UTC
Investigating -
We are currently investigating an issue impacting the availability of hosted Grafana instances in the prod-eu-west-2 region.
Dec 4, 14:50 UTC
Resolved -
This incident has been resolved.
Dec 4, 12:40 UTC
Monitoring -
A fix has been implemented and we are monitoring the results. All affected instances should now be online and functioning normally.
Dec 4, 11:56 UTC
Identified -
The issue has been identified and a fix is being implemented.
Dec 4, 11:10 UTC
Investigating -
We are currently investigating an issue impacting the availability of hosted Grafana instances in the prod-eu-west-2 region. We are actively working to identify the root cause and restore full service.
Dec 4, 10:08 UTC
Resolved -
This incident has been resolved.
Dec 3, 00:30 UTC
Investigating -
We’re currently experiencing degraded user experience with our Slack app (acknowledging, resolving, etc alert groups from Slack)
Dec 2, 23:43 UTC