Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Fixed
Priority: Minor
Fix Version/s: 7.1.0
Affects Version/s: 7.1.0
Component/s: couchbase-bucket
Labels:
None

Triage:
Untriaged
Story Points:
1
Is this a Regression?:
Unknown
Sprint:
KV 2021-Dec

Description

What is the issue?
Currently, couch_dbdump doesn't handle the escape sequences that include '\x' correctly and instead it outputs "
u00ffffff" (unidentified character), which breaks many UTF-8 symbols. I am almost certain that '
u00fffffff' is always outputted for '\x' since, at least for the travel-sample bucket, substituting all its occurrences for '\x' makes the contents of all documents in the couch_dbdump output match the contents of all corresponding documents in the cbriftdump output for a full backup of the same data.

Example:
Instead of outputting

\xe2\x80\x93

escape sequence, which stands for "en dash" symbol, couch_dbdump outputs

\\u00ffffffe2\\u00ffffff80\\u00ffffff93

Steps to reproduce:

Set up and configure a cluster with one data node
Import the travel-sample sample bucket using web UI

Run something like

couch_dbdump --json ~/cb/source/ns_server/data/n_0/data/travel-sample/*.couch.1 | grep "\\u00ffffff"

to get all of the documents that contain '
u00ffffff'.

Attachments

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews
- Show All Issues
- Show Open Issues

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Maksimiljans Januska

Reporter:: Maksimiljans Januska

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 29/Nov/21 9:46 AM

Updated:: 10/Jan/22 2:20 AM

Resolved:: 21/Dec/21 7:07 AM

Gerrit Reviews

There are no open Gerrit changes

Show There are 2 closed Gerrit changes

Hide There are 2 closed Gerrit changes

MB-49819: Dump JSON as ASCII & escape any UNICODE chars: Gerrit Review:

MB-50233 Remove temporary fix for MB-49819: Gerrit Review:

couch_dbdump incorrectly outputs escape sequences that include '\x'

Details

Description

Attachments

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty