Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-23081

Crashes during automated cluster init + join

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • bug-backlog
    • 4.5.0
    • installer
    • None
    • Triaged
    • Unknown

    Description

      I'm running Couchbase Server 4.5.0 community on an EC2 instance running CentOS7.3 and seeing sporadic startup failures.

      Steps to repro

      It should be very easy to repro with these steps. In my last 2 attempts, I repro'd one out of two times.

      Observed Behavior

      After the ec2 instance started up, I ssh'd into the machine and ran:

      systemctl status couchbase-server

      and it shows:

      systemctl status couchbase-server
      ● couchbase-server.service - Couchbase Server
      Loaded: loaded (/usr/lib/systemd/system/couchbase-server.service; enabled; vendor preset: disabled)
      Active: failed (Result: timeout) since Wed 2017-03-01 05:04:33 UTC; 3min 53s ago
      Docs: http://docs.couchbase.com

      Mar 01 05:02:56 ip-172-31-12-251 systemd[1]: Starting Couchbase Server...
      Mar 01 05:02:59 ip-172-31-12-251 systemd[1]: PID file /opt/couchbase/var/lib/couchbase/couchbase-server.pid not readable (yet?) after start.
      Mar 01 05:04:33 ip-172-31-12-251 systemd[1]: couchbase-server.service start operation timed out. Terminating.
      Mar 01 05:04:33 ip-172-31-12-251 systemd[1]: Failed to start Couchbase Server.
      Mar 01 05:04:33 ip-172-31-12-251 systemd[1]: Unit couchbase-server.service entered failed state.
      Mar 01 05:04:33 ip-172-31-12-251 systemd[1]: couchbase-server.service failed.

      I was able to restart it successfully:

      systemctl status couchbase-server
      ● couchbase-server.service - Couchbase Server
      Loaded: loaded (/usr/lib/systemd/system/couchbase-server.service; enabled; vendor preset: disabled)
      Active: active (running) since Wed 2017-03-01 05:10:08 UTC; 10s ago
      Docs: http://docs.couchbase.com
      Process: 2720 ExecStart=/opt/couchbase/bin/couchbase-server – -noinput -detached (code=exited, status=0/SUCCESS)
      Main PID: 2800 (beam.smp)
      CGroup: /system.slice/couchbase-server.service
      ├─2732 /opt/couchbase/lib/erlang/erts-5.10.4.0.0.1/bin/epmd -daemon
      ├─2769 /opt/couchbase/lib/erlang/erts-5.10.4.0.0.1/bin/beam.smp -A 16 – -root /opt/couchbase/lib/erlang -progname erl – -home /opt/couchbase – -smp enable -kernel ine...
      ├─2800 /opt/couchbase/lib/erlang/erts-5.10.4.0.0.1/bin/beam.smp -A 16 -sbt u -P 327680 -K true -swt low -MMmcs 30 -e102400 – -root /opt/couchbase/lib/erlang -progname e...
      ├─2834 sh -s disksup
      ├─2836 /opt/couchbase/lib/erlang/lib/os_mon-2.2.14/priv/bin/memsup
      ├─2837 /opt/couchbase/lib/erlang/lib/os_mon-2.2.14/priv/bin/cpu_sup
      ├─2838 inet_gethost 4
      ├─2839 inet_gethost 4
      ├─2840 /opt/couchbase/lib/erlang/erts-5.10.4.0.0.1/bin/beam.smp -P 327680 -K true – -root /opt/couchbase/lib/erlang -progname erl – -home /opt/couchbase – -pa /opt/co...
      ├─2871 sh -s disksup
      ├─2872 /opt/couchbase/lib/erlang/lib/os_mon-2.2.14/priv/bin/memsup
      ├─2874 /opt/couchbase/lib/erlang/lib/os_mon-2.2.14/priv/bin/cpu_sup
      ├─2880 /opt/couchbase/bin/priv/godu
      ├─2885 sh -s ns_disksup
      ├─2886 /opt/couchbase/bin/priv/godu
      ├─2892 /opt/couchbase/bin/saslauthd-port
      ├─2897 portsigar for ns_1@127.0.0.1
      ├─2898 /opt/couchbase/bin/goport
      ├─2901 /opt/couchbase/bin/memcached -C /opt/couchbase/var/lib/couchbase/config/memcached.json
      └─2904 /opt/couchbase/bin/goxdcr -sourceKVAdminPort=8091 -xdcrRestPort=9998 -isEnterprise=false

      Mar 01 05:10:04 ip-172-31-12-251 systemd[1]: Starting Couchbase Server...
      Mar 01 05:10:05 ip-172-31-12-251 systemd[1]: PID file /opt/couchbase/var/lib/couchbase/couchbase-server.pid not readable (yet?) after start.
      Mar 01 05:10:08 ip-172-31-12-251 systemd[1]: couchbase-server.service: Supervising process 2800 which is not our child. We'll most likely not notice when it exits.
      Mar 01 05:10:08 ip-172-31-12-251 systemd[1]: Started Couchbase Server.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            traun Traun Leyden (Inactive)
            traun Traun Leyden (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty