Details
-
Bug
-
Resolution: Fixed
-
Critical
-
7.1.4, 7.2.0
-
Untriaged
-
0
-
Unknown
-
Analytics Sprint 21
Description
Observed in MB-57601 logs.
In a recent system test run, CBAS is generating a WARNing on every prometheus fetch, where the result is > 4K. There are over 9900 instances of this warning in the logs.
2023-06-23T03:50:37.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-0] Error after header write of chunked response
|
2023-06-23T03:50:47.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-3] Error after header write of chunked response
|
2023-06-23T03:50:57.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-6] Error after header write of chunked response
|
2023-06-23T03:51:07.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-5] Error after header write of chunked response
|
2023-06-23T03:51:17.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-8] Error after header write of chunked response
|
2023-06-23T03:51:27.288-07:00 WARN CBAS.server.ChunkedResponse [HttpExecutor(port:8095)-9] Error after header write of chunked response
|
Also, the result is returning a 500 to ns_server- I do not know if ns_server successfully scrapes otherwise well-formed results w/ a 500 status- if they do not scrape metrics w/ a 500 error, the issue is even more severe than the WARN spam in the log.
EDIT: I can confirm from promtimer that all analytics stats are missing when the 500 is returned for ns_server; so this is a pretty severe issue.
Issue | Resolution |
When the Prometheus stats returned from Analytics exceeded four kilobytes, the status code was inadvertently set to 500 (Internal Error), and this resulted in a large number of warnings in the Analytics warning log. Couchbase Server discarded these statistics. | This has been fixed to properly return a 200 (OK) status code when the size of Prometheus stats exceeds 4KiB, allowing these stats to be recorded properly. The warning is not displayed. |
Attachments
Issue Links
- relates to
-
MB-57601 [System Test] : ERRO CBAS.util.ResourceLeakDetector [nioEventLoopGroup-5-10] LEAK: ByteBuf.release() was not called before it's garbage-collected. See https://netty.io/wiki/reference-counted-objects.html for more information.
- Closed