Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Won't Fix
Priority: Major
Fix Version/s: None
Affects Version/s: 6.5.1, 6.6.0, 6.6.1, 6.5.0, Cheshire-Cat
Component/s: tools
Labels:
None

Triage:
Untriaged
Story Points:
1
Is this a Regression?:
No

Description

What's the issue?
We've seen a couple of cases where restores are "hanging" indefinitely with buckets with a very low residency ratio. I suspect this may be due to the fact that 'TEMP_OOM' errors are retried indefinitely (i.e. we'll never exhaust retries and return an error).

Is there a workaround?
Yes, restore to an adequately provisioned cluster.

What's the fix
We should account for 'TEMP_OOM' failures during a restore, note that they should be handled somewhat specially i.e. we should retry more times for 'TEMP_OOM' failures because in most scenarios data will be evicted allowing further progress.

Attachments

Issue Links

relates to

MB-38686 [CBM] Add support for resuming restores

Closed

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: James Lee

Reporter:: James Lee

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 11/Dec/20 10:37 AM

Updated:: 22/Apr/21 3:34 AM

Resolved:: 22/Apr/21 3:34 AM

Gerrit Reviews

There are no open Gerrit changes

cbbackupmgr TEMP_OOM failures should be accounted for in retries

Details

Description

Attachments

Issue Links

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty