Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-60819

FTS service crashed during rebalance

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • 7.6.0
    • 7.6.0
    • fts
    • Capella
      Couchbase server Enterprise Edition 7.6.0 build 2142

    Description

      1. Created a capella cluster with the following configuration:

      3 KV (8vCPU 32GB RAM), 5 FTS (16vCPU 32GB RAM), 2 N1QL (4vCPU 16GB RAM)

      2. Created a bucket 'sift_bucket', scope 'sift_scope0', collection 'sift_collection0'.

      3. Loaded 5M vector data with dimension 1536.

      4. Created FTS index on 'vector_data' field.

      5. Wait for indexing to complete.

      6. Start running queries on the index.

      7. Start a rebalance in operation (Add a new FTS node).

       

      Observation:

      FTS service crashed with SIGSEGV: segmentation violation error on node-004.

      Service 'fts' exited with status 2. Restarting. Messages:
      /home/couchbase/.cbdepscache/exploded/x86_64/go-1.21.6/go/src/runtime/select.go:327 +0x725 fp=0xc01bafa758 sp=0xc01bafa638 pc=0x452aa5
      github.com/couchbase/cbauth/service.withTimeout.func1()
      /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/cbauth/service/revrpc.go:42 +0x70 fp=0xc01bafa7e0 sp=0xc01bafa758 pc=0xe8e170
      runtime.goexit()
      /home/couchbase/.cbdepscache/exploded/x86_64/go-1.21.6/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc01bafa7e8 sp=0xc01bafa7e0 pc=0x475821
      created by github.com/couchbase/cbauth/service.withTimeout in goroutine 224914
      /home/couchbase/jenkins/workspace/couchbase-server-unix/goproj/src/github.com/couchbase/cbauth/service/revrpc.go:40 +0xa5
       
       
      rax 0x0
      rbx 0x7f48d4000000
      rcx 0x0
      rdx 0x9036000
      rdi 0x7f50fc0162e0
      rsi 0x7f52c3c37580
      rbp 0x89
      rsp 0x7f4a2cf82b30
      r8 0x9
      r9 0x0
      r10 0x0
      r11 0x0
      r12 0xffffffffffffffff
      r13 0x7f50fc0162e0
      r14 0x7f517c000020
      r15 0x99ccc0
      rip 0x7f52c3b3d193
      rflags 0x10206
      cs 0x33
      fs 0x0
      gs 0x0 

      SIGSEGV: segmentation violation
      PC=0x7f52c3b3d193 m=103 sigcode=1
      signal arrived during cgo execution
       
      goroutine 241 [syscall]:
      runtime.cgocall(0x1261020, 0xc034e847e0)
          /home/couchbase/.cbdepscache/exploded/x86_64/go-1.21.6/go/src/runtime/cgocall.go:157 +0x4b fp=0xc034e847b8 sp=0xc034e84780 pc=0x40b7eb
      github.com/couchbase/cbft._Cfunc_get_total_heap_bytes()
          _cgo_gotypes.go:113 +0x48 fp=0xc034e847e0 sp=0xc034e847b8 pc=0x11dd568
      github.com/couchbase/cbft.getMemoryUtilization(...)
          /home/couchbase/jenkins/workspace/couchbase-server-unix/cbft/ns_server.go:733
      github.com/couchbase/cbft.setCurMemoryUsedWith(0xc0000f1020?)
          /home/couchbase/jenkins/workspace/couchbase-server-unix/cbft/ns_server.go:1460 +0x32 fp=0xc034e85e88 sp=0xc034e847e0 pc=0x11e73f2
      github.com/couchbase/cbft.RunRecentInfoCache(0xc000580800)
          /home/couchbase/jenkins/workspace/couchbase-server-unix/cbft/ns_server.go:1544 +0x35a fp=0xc034e85fc8 sp=0xc034e85e88 pc=0x11e779a
      github.com/couchbase/cbft.initNsServerCaching.func1.2()
          /home/couchbase/jenkins/workspace/couchbase-server-unix/cbft/ns_server.go:1311 +0x25 fp=0xc034e85fe0 sp=0xc034e85fc8 pc=0x11e64a5
      runtime.goexit()
          /home/couchbase/.cbdepscache/exploded/x86_64/go-1.21.6/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc034e85fe8 sp=0xc034e85fe0 pc=0x475821
      created by github.com/couchbase/cbft.initNsServerCaching.func1 in goroutine 177
          /home/couchbase/jenkins/workspace/couchbase-server-unix/cbft/ns_server.go:1311 +0xa5

      Rebalance failed

      Rebalance exited with reason {service_rebalance_failed,fts, {agent_died,<35179.4033.0>, {lost_connection, {'ns_1@svc-s-node-004.pqpwqbdwg6snxrdr.sandbox.nonprod-project-avengers.com', shutdown}}}}. Rebalance Operation Id = 759d0f1714a7df5537e79cf69e36a069
       

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            mohsin.ahmed Mohsin Ahmed
            mohsin.ahmed Mohsin Ahmed
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty