Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-18536

Certain buckets can cause very large schema inferencing results

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Major
    • 4.5.0
    • 4.5.0
    • tools
    • None
    • Untriaged
    • Unknown

    Description

      The "tracks3" data set, from Couchbase training, has a schema that wreaks havoc with the schema inferencer. Each track has a subdocument called "reviews", and that subdocument has a different field for each review, where the field name is the id of the person who did the review. Thus there is a huge number of field names, resulting in a schema description hundreds of thousands of lines long. The inferencer needs to do something smarter in cases like this, perhaps having a parameterized maximum number of fields.

      Attachments

        For Gerrit Dashboard: MB-18536
        # Subject Branch Project Status CR V

        Activity

          People

            eben Eben Haber
            eben Eben Haber
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty