Details
-
Improvement
-
Resolution: Unresolved
-
Major
-
None
Description
There are two improvements we wish to make when gathering stats as part of cbcollect:
- If we cannot connect to the server on port 11209, then retry on port 11210. Motivated by CBSE-5616, not the root cause but we saw that memcached was not listening on port 11210 so the inverse is plausible.
- If stats tasks fail then retry them up to 5 times (an arbitrary amount). Motivated by CBSE-5659, in which one of the worker threads hung. Stats tasks can be assigned to any worker thread so in the case of this CBSE many stats were missing as they were assigned to the hung worker. If a stats task timeouts then we should retry it in the hope that it is served by a different worker thread.