Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Critical
Fix Version/s: 6.6.0
Affects Version/s: 6.6.0
Component/s: tools
Labels:
- AutomatedTests
- approved-for-6.6.0

Triage:
Untriaged
Story Points:
1
Is this a Regression?:
Yes

Description

What's the issue?
We are running a test case where we start cbbackupmgr running a backup then asynchronously kill erlang, at which point we wait for cbbackupmgr to timeout. If killed at the right/wrong time then cbbackupmgr will fail to correctly teardown the connection to the cluster.

What's the root cause of the issue?
After looking at the stack trace of the running process/debug logs we can see that gocbcore is receiving an EOF when trying to read from the connect to the cluster. This EOF error doesn't appear to be returned to cbbackupmgr causing a wait group not to be decremented (hence the hang).

If we look at the attached logs we can see that the stream observer 'End' function is not being run with the received EOF error. Gocbcore then continues (in a loop) to indefinitely recreate the connection to the server (using CCCPPOLL).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

backup-0.log
27 kB
23/Jun/20 10:44 AM
backup-with-debug-logging-0.log
190 kB
23/Jun/20 10:44 AM
backup-with-debug-logging-1.log
97 kB
23/Jun/20 10:44 AM

Issue Links

is caused by

GOCBC-929 gocbcore v9 DCP is failing to propagate EOF/socket read failure errors up to the application

Resolved

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: James Lee

Reporter:: James Lee

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 23/Jun/20 10:44 AM

Updated:: 25/Jun/20 11:42 AM

Resolved:: 25/Jun/20 11:42 AM

Gerrit Reviews

There are no open Gerrit changes

Show There is 1 closed Gerrit change

Hide There is 1 closed Gerrit change

MB-40107 Update gocbcore to pick up error propagation patch: Gerrit Review:

cbbackupmgr may hang when shutting down the connection to the cluster

Details

Description

Attachments

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty