Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-17297

CAPI replication fails repeatedly because of incomplete http response

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • 4.1.1, 4.5.0
    • 4.5.0
    • XDCR
    • Security Level: Public
    • None
    • Untriaged
    • Unknown

    Description

      When running CAPI against elastic search, I noticed that replication failed repeatedly. After XDCR sends http requests to target cluster to update documents, it tries to read http response. Sometimes it gets only the first half of the response. Next time it reads the response, the second half of the previous response is prepended to the second response, resulting in an invalid http response and causing XDCR to fail eventually.

      The buffer that XDCR allocates to hold the response is too small. After increasing it to the correct size, the frequency of the problem was greatly reduced. Strangely, it did not go away completely.

      I tried several different ways of getting http response and none worked. The incomplete response is not caused by buffer being too small or read time out being too small. Need some more investigation.

      The easiest way to work around this problem is to let CAPI reset tcp connection when an incomplete response is seen.

      Attachments

        Issue Links

          For Gerrit Dashboard: MB-17297
          # Subject Branch Project Status CR V

          Activity

            People

              arunkumar Arunkumar Senthilnathan (Inactive)
              yu Yu Sui (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty