Description
From stack trace of MB-48884, I noticed the following:
goroutine 14831357 [semacquire, 658 minutes]:
|
sync.runtime_Semacquire(0xc06ae50238)
|
/home/couchbase/.cbdepscache/exploded/x86_64/go-1.16.5/go/src/runtime/sema.go:56 +0x45
|
sync.(*WaitGroup).Wait(0xc06ae50230)
|
/home/couchbase/.cbdepscache/exploded/x86_64/go-1.16.5/go/src/sync/waitgroup.go:130 +0x65
|
github.com/couchbase/goxdcr/pipeline_svc.(*CheckpointManager).Stop(0xc0c7d382c0, 0xc0000b2570, 0x30)
|
/home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/pipeline_svc/checkpoint_manager.go:710 +0x5a
|
github.com/couchbase/goxdcr/pipeline_ctx.(*PipelineRuntimeCtx).stopService(0xc0ad581640, 0xfed4bd, 0x11, 0x11337e8, 0xc0c7d382c0, 0xc0dbcd2610, 0xc09ede7da0)
|
/home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/pipeline_ctx/pipeline_runtimeCtx.go:201 +0xa4
|
created by github.com/couchbase/goxdcr/pipeline_ctx.(*PipelineRuntimeCtx).Stop.func2
|
/home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/goxdcr/pipeline_ctx/pipeline_runtimeCtx.go:167 +0x32c
|
CkptMgr's waitgroup.Done() isn't being called in all conditions and can lead to the hang, which can lead to pipeline runtime context stop timeouts
Attachments
Issue Links
- relates to
-
MB-48884 [System Test] Few cluster operations failed due to error - Internal server error
- Closed