-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem or challenge?
Tracking ticket for next release, also a place to track desired inclusions
Previous release will be #16799 (Sep, 2025) so next major release would be approximately late Nov 2025
Steps:
- Create a
branch-51branch: - Create blog post
- Update version and changelog:
- Test with DataFusion Python
- Test with DataFusion Comet
- Test with delta.rs:
- Test vortex:
- Test with iceberg-rust:
- Test with LakeSail:
- Test with parquet viewer
- Test with datafusion-materialized-views
- Voting Thread:
- Publish to crates.io
- Create ticket for next release:
TODOs
Prior release tickets:
Related
- Boston USA Nov 2025 meetup: DISCUSSION: DataFusion Meetup in Boston, USA - Nov 12, 2025 #16703
Features to mention in the blog (if they make it)
- Support Decimal32/64 types #17501 from @AdamGS
- Support
WHERE,ORDER BY,LIMIT,SELECT,EXTENDpipe operators #17278 (comment) - Reduce number of object store requests when reading parquet files by default (set
metadata_size_hint) #18118 - [EPIC] A collection of items to improve CASE performance #18075
- feat: change Expr OuterReferenceColumn and Alias to Box type for reducing expr struct size #16771
- Examples of extending SQL syntax #17824 from the @theirix
Issuses needed
Bugs that need to be fixed
- Restore window sort optimizations without exponential planning time #17624
- Exponential planning time when window function is partitioned by multiple columns #17401
- Non-constant
DATE_TRUNCexpression regression for values before epoch #18334 - Optimizer 'common_sub_expression_eliminate' fails in a window function (SQLStorm) #17770 / fix: optimizer
common_sub_expression_eliminatefails in a window function #17852 - CI is failing on main /
current_date() = cast(now() as date);#18062 - NowFunc::new() seeds the timezone with "+00" while the canonical UTC offset remains "+00:00" in ConfigOptions::default(). #18219
- to_timestamp(double) gives different results depending on scalar/vectorized call context #16678
- Regression:
sql_plannerbenchmark panic'ing on main #17801 - Respect execution timezone in
to_timestampand related functions #18025
Community Wishlist
- Streaming Aggregate operator not being used in deduplication of pre-sorted Parquet files #16919
- Regression:
DataFrameWriteOptions::with_single_file_outputproduces a directory #13323 - fix(parquet): converting parquet schema with backward compatible repeated struct/primitive with provided arrow schema arrow-rs#8496
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request