[BP to 7.1.5] Auth Error floods the logs for upgrade from 7.0.4 to 7.2.0

Description

Short description:

Indexer had an incorrect logic to identify active indexer nodes during multi-service rebalance (at least kv and index service part of rebalance). This could cause a potential downtime on new clients created leading to failures in index creation, builds, or other DDL operations.

Problem Description:

When a rebalance is performed, new nodes which are to be added to the cluster show up in nodes endpoint with status as `active`. If an active node has `index` service, we consider it for port translation in our iterations. However, the service ports may not show up in nodeServices endpoint until the service rebalance is started which will happen after KV rebalance. Hence any calls to translate port during the KV rebalance could lead to port translation errors for nodes which come after the new nodes in the nodes endpoint. This means until KV rebalance, index creations, builds, alters could have a potential downtime on new clients.

Problem resolution:

Update the information used by TranslatePort to use the nodeServices endpoint

 

Issue

Resolution

Indexer contained incorrect logic to identify active indexer nodes during a multi-service rebalance. This caused potential downtime and failures in index creation, builds, or other DDL operations.

The information used by TranslatePort has been updated to use the node Services endpoint to correct this issue.

Components

Affects versions

Fix versions

Labels

Environment

Enterprise Edition 7.0.4 build 7279 couchbase-server-enterprise-7.2.0-5235

Link to Log File, atop/blg, CBCollectInfo, Core dump

None

Release Notes Description

None

Activity

Show:

Hemant Rajput June 12, 2023 at 10:01 AM

Validated on 7.1.5-3843

CB robot June 9, 2023 at 8:18 PM

Build couchbase-server-7.1.5-3842 contains indexing commit 1fc880a with commit message:
https://couchbasecloud.atlassian.net/browse/MB-57295#icft=MB-57295 - fix auth errors in watcher

Fixed
Pinned fields
Click on the next to a field label to start pinning.

Details

Assignee

Reporter

Is this a Regression?

Unknown

Triage

Untriaged

Story Points

Priority

Instabug

Open Instabug

PagerDuty

Sentry

Zendesk Support

Created June 7, 2023 at 9:25 AM
Updated September 19, 2023 at 11:16 AM
Resolved June 9, 2023 at 6:57 PM
Instabug