Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Fixed
Priority: Major
Fix Version/s: 1.0.0
Affects Version/s: None
Component/s: operator
Labels:
- kubernetes

Epic Link:
Persistent Volumes

Description

Operator detects node(s) down
Watch logs to detect that down node cannot be Auto-Failed over
Any event in the autofailover module != EVENT_NODE_AUTO_FAILOVERED
Check that the persistent volumes of the failed pods have status = Ready
Quit if any Pod volumes are inaccessible
Delete failed pod if it exists in kubernetes (Pod volumes are not deleted)
Recreate Pod with exact same name and spec as the failed Pod.
Wait for new pod to become active within cluster
Repeat for all down nodes until all Pods are recovered

Attachments

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Tommie McAfee (Inactive)

Reporter:: Tommie McAfee (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 05/Apr/18 4:48 PM

Updated:: 21/Aug/20 10:30 AM

Resolved:: 29/May/18 8:55 AM

Gerrit Reviews

There are no open Gerrit changes

Show There are 6 closed Gerrit changes

Hide There are 6 closed Gerrit changes

K8S-259: node recovery after unsuccessful auto-failover: Gerrit Review:

K8S-259: Reuse volume mounts of persisted pods: Gerrit Review:

K8S-259: Delta recovery of failed over nodes: Gerrit Review:

K8S-259: node recovery after unsuccessful auto-failover: Gerrit Review:

K8S-259: node recovery after unsuccessful auto-failover: Gerrit Review:

K8S-259: Don't reconcile the cluster if nodes are warming up: Gerrit Review:

PagerDuty