Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Unresolved
Priority: Minor
Fix Version/s: Goldfish Public Preview
Affects Version/s: Goldfish Private Preview
Component/s: analytics
Labels:
- triaged

Epic Link:
CBAS: Copy To Statement
Story Points:
0

Description

Given that COPY TO always write to a shared storage (e.g., S3), then redistributing the data isn't necessary. Two or more compute partitions can write to the same partition's destination without any conflicts as each file is prefixed by the compute partition ID.

This proposed writing scheme is less sensitive to skewness and each compute partition would have roughly the same number of tuples to write (of course that depends on the source query).

Analytics currently fails to write to a non-empty partition destination. Overcoming this issue would make this scheme possible to incorporate.

Attachments

Gerrit Reviews

- Issue Only
- Show All Reviews
- Show Open Reviews

No reviews matched the request. Check your Options in the drop-down menu of this sections header.

Activity

People

Assignee:: Wail Alkowaileet

Reporter:: Wail Alkowaileet

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 26/Feb/24 2:24 PM

Updated:: 02/Apr/24 10:47 AM

Gerrit Reviews

There are no open Gerrit changes

Avoid repartitioning in COPY TO if no ORDER BY is specified

Details

Description

Attachments

Gerrit Reviews

Activity

People

Dates

Gerrit Reviews

PagerDuty