Details
-
Bug
-
Resolution: Fixed
-
Major
-
7.6.2
-
Untriaged
-
0
-
Unknown
-
Analytics Sprint 42
Description
This issue takes place when a job is cancelled, and the cancelled counter is incremented (correctly as expected), then the JobExecutor issues cleanup tasks for this cancelled job with a status of FAILED, leading the execution of these cleanup tasks to also increment the failed job counter (due to the FAILED status).
We need to keep track of what job id has been stat-collected already, and not update the stats anymore on any follow up tasks for that specific job id.
Attachments
Issue Links
- relates to
-
MB-59430 [Observability] [Analytics Service] SRE team requested new metrics for better monitoring
- Closed
- links to