Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-53321

Babysitter: Detect and recycle indexing service when it's stuck

    XMLWordPrintable

Details

    • 1

    Description

      Support have encountered numerous issues (see Links for examples) where the correct course of action to unblock the customer is to restart the indexing service.
      Raising this ticket to consider if it would be appropriate for the babysitter to be able to detect and resolve such situations. e.g. if we repeatedly fail to get stats in the following way, instead of (or prior to) failing over the entire node, could we kick the service and restart it?

      Failed to grab service index status
      : {exit,
                                            {timeout,
                                             {gen_server,call,
                                              ['service_agent-index',get_status,
                                               2000]}},
                                            [{gen_server,call,3,
                                              [{file,"gen_server.erl"},{line,214}]},
                                             {ns_heart,grab_one_service_status,1,
                                              [{file,"src/ns_heart.erl"},
                                               {line,494}]},
                                             {ns_heart,
                                              '-grab_service_statuses/0-lc$^1/1-1-',
                                              1,
                                              [{file,"src/ns_heart.erl"},
                                               {line,490}]},
      

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              Abhijeeth.Nuthan Abhijeeth Nuthan
              dhaikney David Haikney (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:

                Gerrit Reviews

                  There are no open Gerrit changes

                  PagerDuty