Resolved -
This incident has been resolved.
Mar 23, 08:13 UTC
Update -
We are currently replaying the pending tasks and monitoring system stability.
Mar 20, 15:56 UTC
Monitoring -
Pod scheduling and task execution have resumed following administrative actions on the cluster. The situation is currently being monitored.
Mar 20, 09:05 UTC
Identified -
We are currently experiencing an incident affecting task execution due to a limitation in the cluster responsible for coordination and scheduling. This issue started yesterday around 18:26 and has prevented new processing tasks from starting.
The engineering team has identified the root cause to mitigate the issue. Investigation and remediation efforts are ongoing.
Mar 20, 08:25 UTC