This incident has been resolved.
May 26, 20:59 UTC
We are still working on a resolution, but most clusters' metrics have been completely restored and are up to date. About 12% of clusters are still backfilling historical metrics since the start of the incident. A small number of customers will have a small gap in their metrics data.
May 26, 20:38 UTC
We have deployed an initial fix and have some data for affected clusters currently backfilling.
May 26, 18:02 UTC
A fix has been implemented and we are monitoring the results.
May 26, 15:42 UTC
The issue has been identified and a fix is being implemented.
May 26, 14:39 UTC