Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-10370

ep-engine deadlock in write-heavy DGM cases

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Blocker
    • 3.0.3, 4.0.0
    • 2.2.0, 2.5.0, 3.0, 3.0.2
    • couchbase-bucket
    • Security Level: Public
    • Platform = Physical
      OS = CentOS 6.5
      CPU = Intel Xeon E5-2630
      Memory = 64 GB
      Disk = 2 x SSD
    • Yes
    • Mar 9 - Mar 27

    Description

      This is not a new issue, we discussed it many times.

      In extremely write-heavy cases we overload servers, memory usage reaches 95% of bucket quota, we eject all replica items... eventually system becomes unusable.

      I'm creating this ticket because of XDCR. In 3.0 we can achieve very high throughput of XDCR operations, throttling it doesn't make sense. According to PM team some "users" deploy XDCR within the same data center so this is quite realistic scenario.

      Feel free to close this ticket as duplicate of existing bugs. Though I didn't manage to find anything well-defined.

      added: this issue returns for most DGM settings < 20%, and even moderate update ops of 3K/node. only current SE/PM advice is to increase memory to avoid, but the value of DGM is lost for many expected conditions.

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              wayne Wayne Siu
              pavelpaulau Pavel Paulau (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty