Description
Looking at the logs from MB-48672, we see an extraordinary number of async listeners goroutines that did not stop properly:
goroutine profile: total 8789
|
4278 @ 0x43c9a5 0x44c88f 0x94e2b1 0x471981
|
# 0x94e2b0 github.com/couchbase/goxdcr/component.(*AsyncComponentEventListenerImpl).start+0x250 /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/component/async_listener.go:68
|
|
628 @ 0x43c9a5 0x44c88f 0x751ab4 0x471981
|
# 0x751ab3 github.com/couchbase/gomemcached/client.(*UprFeed).sendCommands+0xf3 /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/gomemcached/client/upr_feed.go:340
|
|
433 @ 0x43c9a5 0x44c88f 0x98057f 0x471981
|
# 0x98057e github.com/couchbase/goxdcr/parts.(*XmemNozzle).processData_sendbatch+0x1fe /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/parts/xmem_nozzle.go:1018
|
|
433 @ 0x43c9a5 0x44c88f 0x990736 0x471981
|
# 0x990735 github.com/couchbase/goxdcr/parts.(*XmemNozzle).receiveResponse+0xf5 /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/parts/xmem_nozzle.go:2577
|
|
433 @ 0x43c9a5 0x44c88f 0x993650 0x471981
|
# 0x99364f github.com/couchbase/goxdcr/parts.(*XmemNozzle).selfMonitor+0x20f /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/parts/xmem_nozzle.go:2860
|
|
431 @ 0x43c9a5 0x44c88f 0x99488c 0x471981
|
# 0x99488b github.com/couchbase/goxdcr/parts.(*XmemNozzle).checkAndRepairBufferMonitor+0x26b /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/parts/xmem_nozzle.go:2933
|
Attachments
Issue Links
- relates to
-
MB-48728 [System Test] XDCR OOM killed multiple times
- Closed
-
MB-48772 [System Test] Rebalance exited with reason not_all_nodes_are_ready_yet
- Closed
-
MB-48787 XDCR - backfill pipeline not stopping all async listeners
- Closed
-
MB-48672 [System Test] batchGetMeta received fatal error and had to abort - error observed in longevity
- Closed
-
MB-48677 [System Test][XDCR] RuntimeCtx : Execution timed out - observed in longevity during topology changes
- Closed