Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: 6.5.0
Affects Version/s: 5.5.2
Component/s: XDCR
Labels:
None

Triage:
Triaged
Is this a Regression?:
No

Description

I was trying to setup xdcr replication from an in-house cluster to EC2. Following things happened:

1. Setup XDCR from 4 node(2 data nodes) inhouse cluster to 16 node EC2 cluster ( 8 data nodes) for 1 bucket (msm).

2. I didn't provide hostname from the EC2 nodes initially. That caused the UI to initially took a long time to respond about what's happening. But after it came back, I tried to delete the replication. There was initially no response and then an error reported on the UI and multiple attempts to delete the replication were unsuccessful.

3. I restarted the goxdcr process on one data node in source cluster 172.23.97.37. This cleaned up the XDCR replication from the UI.

Restarted at:
ns_1@172.23.97.37 6:43:52 PM Tue Sep 18, 2018

4. I fixed the hostnames on EC2 cluster. Setup the XDCR replication on the source cluster again. This time it started replicating.

5. After replicating 50% of the data, the progress stopped. I then killed the goxdcr process on the 2nd data node in the source cluster and that kicked off the replication of the remaining 50% of the data.

Message in logs before restart:

2018-09-19T04:09:59.411-07:00 INFO GOXDCR.PipelineMgr: Replication Status = map[a8da6785a5cce7dc20c1f861ba93a500/msm/msm:name={a8da6785a5cce7dc20c1f861ba93a500/msm/msm}, status={Pending}, errors={[]}, progress={Pipeline has been stopped}

Restarted at:
ns_1@172.23.97.38 12:19:26 PM Wed Sep 19, 2018

Source Cluster Logs:
https://s3.amazonaws.com/cb-customers/deepkaran/collectinfo-2018-09-19T192037-ns_1%40172.23.97.37.zip
https://s3.amazonaws.com/cb-customers/deepkaran/collectinfo-2018-09-19T192037-ns_1%40172.23.97.38.zip
https://s3.amazonaws.com/cb-customers/deepkaran/collectinfo-2018-09-19T192037-ns_1%40172.23.97.39.zip
https://s3.amazonaws.com/cb-customers/deepkaran/collectinfo-2018-09-19T192037-ns_1%40172.23.97.40.zip

Let me know if you need destination cluster logs as well.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

ns_server.goxdcr.log_172.23.97.37.zip
26.89 MB
21/Sep/18 12:00 PM
ns_server.goxdcr.log_172.23.97.38.zip
23.89 MB
21/Sep/18 11:59 AM

Issue Links

backports to

MB-31398 Alice CLONE MB-31352 - xdcr replication hang

Closed

MB-31567 5.1.3 CLONE MB-31352 - xdcr replication hang

Closed

MB-31568 5.5.3 CLONE MB-31352 - xdcr replication hang

Closed

Activity

People

Assignee:: Pavithra Mahamani (Inactive)

Reporter:: Deepkaran Salooja

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 19/Sep/18 12:32 PM

Updated:: 23/May/19 3:22 PM

Resolved:: 24/Sep/18 2:04 PM

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

PagerDuty