Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: 1.1.0
Affects Version/s: None
Component/s: operator
Labels:
None

Description

The following steps lead to a scenario where nodes are marked as Ready, but are actually down leading to continuously failing reconcile loop:

Start a 3 node cluster
Delete 2 nodes 0001 & 0002

This leaves only node 0000 as ready:

  Members:

    Index:  3

    Ready:

      Name:  cb-example-0000

    Unready:

      Name:  cb-example-0001

      Name:  cb-example-0002

Wait for node 0001 to be started.
Delete node 0000 while 0001 is being started

At this point 0001 is actually the Ready node, but only 0000 is still marked as ready.

This causes reconcile to fail because we use the Ready members as API clients. The fix is to Resync the ready members after reconcile fails.

Attachments

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Tommie McAfee (Inactive)

Reporter:: Tommie McAfee (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 27/Sep/18 12:30 PM

Updated:: 16/Oct/18 11:08 AM

Resolved:: 16/Oct/18 11:08 AM

Gerrit Reviews

There are no open Gerrit changes

Show There are 2 closed Gerrit changes

Hide There are 2 closed Gerrit changes

K8S-605: Set rerr when reconcile fails: Gerrit Review:

K8S-605: Ensure Pods for readyMembers are running: Gerrit Review:

PagerDuty