Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-59657

persist_to_majority on magma bucket fails with status code 0x00a3

    XMLWordPrintable

Details

    • Bug
    • Resolution: User Error
    • Major
    • 7.6.0
    • 7.6.0
    • qe
    • Operating System : Debian 12
      Couchbase Enterprise Edition : 7.6.0-1484

    Description

      Steps to repro

      1. Created a 2 node kv cluster
      2. Created a magma bucket
      3. Started writes with durability=PERSIST_TO_MAJORITY (Tried to load 16 items onto the bucket)

      Observed that the write calls return even when sdk_timeout is 1200 seconds

      Status: The SyncWrite request has not completed in the specified time and has ambiguous result (0x00a3)

      Further observing that the response is returned within 75 seconds instead of the wait for 1200 seconds

      On 7.6.0 the Server Recv -> Send duration is seen 11.29 while on 7.2.0 it is seen as 35049206.5981604

      7.6.0 Server Recv -> Send duration

      7.2.0 Server Recv -> Send duration

       

      Logs for 7.6.0-1786

      server logs
      https://cb-engineering.s3.amazonaws.com/MB-59657/collectinfo-2023-11-15T024255-ns_1@172.23.96.168.zip
      https://cb-engineering.s3.amazonaws.com/MB-59657/collectinfo-2023-11-15T024255-ns_1@172.23.121.71.zip

      server pcaps
      https://cb-engineering.s3.amazonaws.com/MB-59657/172.23.96.168_pcaps.zip
      https://cb-engineering.s3.amazonaws.com/MB-59657/172.23.121.71_pcaps.zip

      client pcaps
      https://cb-engineering.s3.amazonaws.com/MB-59657/client.pcap

      Java sdk logs
      https://cb-engineering.s3.amazonaws.com/MB-59657/JavaSDK.log

       

      A pcap for the same is uploaded for 7.2.0 and 7.6.0-1484 as well.

      7.2.0 --> https://cb-engineering.s3.amazonaws.com/MB-59657/7.2.0.pcap

      7.6.0 - 1484 ---> https://cb-engineering.s3.amazonaws.com/MB-59657/7.6.0-1484.pcap

       

      Note: Same test with bucket_storage=couchstore works as expected. (All docs are getting inserted without any timeouts)


       

       

      TAF test to reproduce

      guides/gradlew --refresh-dependencies testrunner -P jython=/opt/jython/bin/jython -P 'args=-i /data/workspace/debian-p0-durability-vset00-00-castest_6.5_P1_persist_majority/testexec.68448.ini -t castest.opschangecas.OpsChangeCasTests.ops_change_cas,mutate_times=20,durability=PERSIST_TO_MAJORITY,nodes_init=2,doc_ops=touch,num_items=16'

      Job link : http://cb-logs-qe.s3-website-us-west-2.amazonaws.com/7.6.0-1767/jenkins_logs/test_suite_executor-TAF/284857/

      Job : debian-durability_castest_6.5_P1_persist_majority

       

      Attachments

        1. 7.2.0_pcap.png
          7.2.0_pcap.png
          299 kB
        2. 7.2.0_pcap-1.png
          7.2.0_pcap-1.png
          299 kB
        3. 7.2.0.pcap
          9.94 MB
        4. 7.6.0-1484.pcap
          23.75 MB
        5. image-2023-11-15-16-38-19-455.png
          image-2023-11-15-16-38-19-455.png
          68 kB
        6. image-2023-11-15-16-41-01-914.png
          image-2023-11-15-16-41-01-914.png
          41 kB
        7. Screenshot 2023-11-15 at 14.10.59.png
          Screenshot 2023-11-15 at 14.10.59.png
          159 kB
        8. Screenshot 2023-11-15 at 14.13.43.png
          Screenshot 2023-11-15 at 14.13.43.png
          163 kB

        Activity

          People

            raghav.sk Raghav S K
            raghav.sk Raghav S K
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              PagerDuty