Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-62310

>95% queries erroring out with enough RAM and CPU availability

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Critical
    • 7.6.4
    • 7.6.2
    • fts
    • Enterprise Edition 7.6.2 build 3710

    Description

      Steps:

      1. Create 5 node fts cluster
      2. Load 60M vector data with 1536 dims
      3. Assign 16GB ram to each fts node
      4. Create 1 index with 90 partition (18 part/node) 

      Run 10QPS for 1 Min, following was observed

      1. when indexing reached 10M. 100% queries failed
      2. when indexing reached 60M. 100% queries failed
      3. Manually ran some queries via UI, 4-5 passed
      4. Repeated step 1, saw more than 95% queries failing.

      Queries are failing not just with 429 BUT ALSO:

      172.23.105.122: fts
      2024-06-12T21:41:36.468-07:00 [ERRO] rest: error code: 400, msg: rest_index: Query, indexName: bucket1.scope_0.index1, err: pindex_consistency: ConsistencyWaitGroup cancelled -- rest.ShowErrorBody() at rest.go:59
      2024-06-12T21:41:36.469-07:00 [ERRO] rest: error code: 400, msg: rest_index: Query, indexName: bucket1.scope_0.index1, err: pindex_consistency: ConsistencyWaitGroup cancelled -- rest.ShowErrorBody() at rest.go:59
      2024-06-12T21:41:36.470-07:00 [ERRO] rest: error code: 400, msg: rest_index: Query, indexName: bucket1.scope_0.index1, err: pindex_consistency: ConsistencyWaitGroup cancelled -- rest.ShowErrorBody() at rest.go:59
      2024-06-12T21:41:36.471-07:00 [ERRO] rest: error code: 400, msg: rest_index: Query, indexName: bucket1.scope_0.index1, err: pindex_consistency: ConsistencyWaitGroup cancelled -- rest.ShowErrorBody() at rest.go:59

       

      2024-06-12T21:42:55.995-07:00 [ERRO] rest: error code: 400, msg: rest_index: Query, could not read request body, indexName: bucket1.scope_0.index1 -- rest.ShowErrorBody() at rest.go:59...[truncated]

      Memory usage only spiked to like 3GB other 13GB were still available.

      Attachments

        1. image-2024-06-13-13-57-09-690.png
          image-2024-06-13-13-57-09-690.png
          208 kB
        2. image-2024-06-13-13-57-31-689.png
          image-2024-06-13-13-57-31-689.png
          229 kB
        3. image-2024-06-13-14-07-15-906.png
          image-2024-06-13-14-07-15-906.png
          126 kB
        4. image-2024-06-17-18-46-47-118.png
          image-2024-06-17-18-46-47-118.png
          1.04 MB
        5. image-2024-06-17-18-50-19-430.png
          image-2024-06-17-18-50-19-430.png
          231 kB
        6. image-2024-06-17-18-51-17-893.png
          image-2024-06-17-18-51-17-893.png
          189 kB
        7. image-2024-06-17-18-54-49-301.png
          image-2024-06-17-18-54-49-301.png
          428 kB
        8. image-2024-06-17-18-55-06-986.png
          image-2024-06-17-18-55-06-986.png
          430 kB
        9. image-2024-06-17-18-55-33-803.png
          image-2024-06-17-18-55-33-803.png
          308 kB
        10. image-2024-06-27-14-48-19-874.png
          image-2024-06-27-14-48-19-874.png
          86 kB
        11. image-2024-06-27-14-58-29-536.png
          image-2024-06-27-14-58-29-536.png
          133 kB
        12. image-2024-06-27-15-02-03-370.png
          image-2024-06-27-15-02-03-370.png
          289 kB
        13. image-2024-06-27-15-13-04-860.png
          image-2024-06-27-15-13-04-860.png
          1.45 MB
        14. image-2024-06-27-15-16-53-791.png
          image-2024-06-27-15-16-53-791.png
          126 kB
        15. node1.pprof
          74 kB
        16. node2.pprof
          62 kB
        17. Screenshot 2024-06-13 at 11.44.36 AM.png
          Screenshot 2024-06-13 at 11.44.36 AM.png
          34 kB
        18. Screenshot 2024-06-14 at 11.32.34 AM.png
          Screenshot 2024-06-14 at 11.32.34 AM.png
          207 kB
        19. Screenshot 2024-06-14 at 11.59.06 AM.png
          Screenshot 2024-06-14 at 11.59.06 AM.png
          100 kB
        20. Screenshot 2024-06-14 at 12.01.26 PM.png
          Screenshot 2024-06-14 at 12.01.26 PM.png
          88 kB
        21. Screenshot 2024-06-17 at 12.44.05 PM.png
          Screenshot 2024-06-17 at 12.44.05 PM.png
          66 kB
        22. Screenshot 2024-06-27 at 8.25.33 AM.png
          Screenshot 2024-06-27 at 8.25.33 AM.png
          72 kB
        23. Screenshot 2024-06-27 at 8.36.53 AM.png
          Screenshot 2024-06-27 at 8.36.53 AM.png
          77 kB
        24. Screenshot 2024-06-28 at 11.55.43 AM.png
          Screenshot 2024-06-28 at 11.55.43 AM.png
          621 kB
        25. Screenshot 2024-07-12 at 11.58.08 AM.png
          Screenshot 2024-07-12 at 11.58.08 AM.png
          157 kB
        26. Screenshot 2024-07-12 at 11.58.25 AM.png
          Screenshot 2024-07-12 at 11.58.25 AM.png
          90 kB
        27. toomuchcpu.zip
          117.79 MB
        28. toomuchcpu2.zip
          100.07 MB

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              sarthak.dua Sarthak Dua
              sarthak.dua Sarthak Dua
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty