Patched DF 49.0.1 (take 1) #72
Draft
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Tracking issue: https://github.com/influxdata/influxdb_iox/issues/14924
Patches
Patches map to commits 1:1 (i.e. every patch is exactly 1 commit) and are ordered for easier correlation of the description and the respective commits. They are also grouped in 3 stages.
A: Dummy
No actual patches, can be dropped at any point:
B: CI Fixes
Need to get CI up and running before picking any actual patches:
None.
All commits afterwards should build cleanly!
C: Patches
These are the actual relevant patches:
chore: default=true for skip_physical_aggregate_schema_check, and add warn logging
:until we chase down all warnings in our iox logs (see https://github.com/influxdata/influxdb_iox/issues/12404 )
fix: temporary fix to handle incorrect coalesce (inserted during EnforceDistribution) which later causes an error during EnforceSort (without our patch). The next DataFusion version 46 upgrade does the proper fix, which is to not insert the coalesce in the first place.
:There is EAR-5822 (also see https://github.com/influxdata/influxdb_iox/issues/13310 ) despite what the note in Patched DataFusion version
45.0.0
#54 and ParallelizeSorts, a subrule of EnforceSorting optimizer, should not remove necessary coalesce. apache/datafusion#14691 (comment) say, this is still required for DF version 46. Otherwise the regression test fails. Also see this slack thread.fix(build-wasm): put arrow-ipc/zstd dep under compression feature flag
:That's fix(build-wasm): put
arrow-ipc/zstd
dep undercompression
feature apache/datafusion#16844. I need this for https://github.com/influxdata/datafusion-udf-wasm . Can be dropped with DF 50.Support centroids config for approx_percentile_cont_with_weight
:That's Support
centroids
config forapprox_percentile_cont_with_weight
apache/datafusion#17003 . Needed so that the next patch applies cleanly. Can be dropped with DF 50.(Re)Support old syntax for approx_percentile_cont and approx_percentile_cont_with_weight
:That's (Re)Support old syntax for
approx_percentile_cont
andapprox_percentile_cont_with_weight
apache/datafusion#16999 . Can be dropped with DF 50.feat: support distinct for window
That's feat: support distinct for window apache/datafusion#16925 because a customer wants it (see https://github.com/influxdata/EAR/issues/6252 ). Can be dropped with DF 50.