Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-57889

Race between active job failure and its recovery task leaving link disconnected

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • 7.2.0
    • 7.1.4
    • analytics
    • Untriaged
    • 0
    • Unknown
    • Analytics Sprint 28, Analytics Sprint 29, Analytics Sprint 30, Analytics Sprint 31, Analytics Sprint 32, Analytics Sprint 33

    Description

      When an active job fails while its recovery task is still running, the failing job will assume recovery is still on-going. At the same time, the recovery task will assume the job did not fail. This will result in keeping the link in a disconnected state and new data not being ingested.

      The workaround is to manually disconnect and reconnect the link or call the analytics Cluster Restart API to retrigger the link recovery task.

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              murtadha.hubail Murtadha Hubail
              murtadha.hubail Murtadha Hubail
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty