Details
-
Bug
-
Resolution: Duplicate
-
Test Blocker
-
None
-
None
-
None
-
0
Description
The KV performance against 7.6 when the cluster undergoes rebalances seems to be quite poor compared to previous server versions and often comes with a number of timed out operations.
Here we're using FIT-SIT against Capella 7.6, doing a 3 -> 5 node scale:
You can see a period of 1.5-2mins where the throughput has dropped a lot - basically to 0. And there are 72 KV errors in this time (the performer needs an improvement to record the actual time these error happen, hence them not showing on the graph).
Compared to Capella 7.2 with the same test:
No errors, and very little throughput disturbance.
This seems to happen for any scale up or down, and we see similar things for 7.6 on SDKD, though the performance does generally seem to be better with sdkd.
Note that in all test cases the SDK does eventually re-establish the same throughput as before the cluster change.