Uploaded image for project: 'Couchbase Server'
  1. Couchbase Server
  2. MB-31894

[CX] Improve the sampling phase of the parallel sort

    XMLWordPrintable

Details

    Description

      Currently, during the sampling phase of the parallel sort, the whole dataset is scanned, sampled, and materialized. Then, after computing the splitting vector, the materialized dataset is read to start redistributing the tuples based on the splitting vector and continue the rest of the sort process.

      An improvement to this is to avoid scanning and materializing the whole dataset during the sampling phase by reading and materializing only portion of the dataset enough to compute the splitting vector.

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            ali.alsuliman Ali Alsuliman
            ali.alsuliman Ali Alsuliman
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:

              Gerrit Reviews

                There are no open Gerrit changes

                PagerDuty