feat: Deduplicating recursive CTE implementation #18254

Tpt · 2025-10-23T20:00:04Z

Rely on aggregate GroupValues abstraction to build a hash table of the emitted rows that is used to deduplicate

We might make things a bit more efficient by rewriting a hash table wrapper just for deduplication, but this implementation should give a fair baseline

Which issue does this PR close?

Closes Support deduplicating UNION in recursive CTE #18140.

Rationale for this change

Implements deduplicating recursive CTE (i.e. UNION inside of WITH RECURSIVE) using a hash table. I reuse the one from aggregates to avoid rebuilding a full wrapper and specialization for types. Each time a batch is returned by the static or the recursive terms of the CTE, the hash table is used to remove already seen rows before emitting the rows and keeping them in memory for the next recursion step.

What changes are included in this PR?

Reusing GroupValues trait implementations inside of RecursiveQueryExec to get deduplication working.

Are these changes tested?

Yes, some sqllogictests have been added, including ones that would lead to infinite recursion is deduplication where disabled.

Are there any user-facing changes?

No

Rely on aggregate GroupValues abstraction to build a hash table of the emitted rows that is used to deduplicate We might make things a bit more efficient by rewriting a hash table wrapper just for deduplication, but this implementation should give a fair baseline

dqkqd · 2025-10-24T23:32:51Z

datafusion/sqllogictest/test_files/cte.slt

+query I
+WITH RECURSIVE nodes AS (
+    SELECT 1 as id
+    UNION ALL


I think this was meant to be UNION.

And the original test wasn't generating duplicate results with UNION ALL.
Maybe we should use a different query, I could think of some thing like this,
but I believe there should be a better one.

WITH RECURSIVE nodes AS ( SELECT id from (VALUES (1), (2)) nodes(id) UNION SELECT id + 1 as id FROM nodes WHERE id < 4 ) SELECT * FROM nodes;

Thank you for spotting this! Your query is much better indeed. Added.

tobixdev

From my perspective this is a very nice and concise solution to the problem.

Furthermore, from my understanding this should also correctly terminate the recursion as only each unique row is pushed into the WorkTable and at some point (as it can be seen in the closure example) this will reach a fix point.

What I am also thinking about is test coverage. My gut feeling says there should be some test cases in the SQLite test suite that cover distinct recursion. Would this cause the extended test suite to fail? Ideally, this solution passes all these test cases now! 🥳 However, I am a bit unsure how this is setup currently.

Thank you!

CAVEAT: I am by no means a DataFusion (nor recurisve query) expert so take my comments with a grain of salt.

datafusion/physical-plan/src/recursive_query.rs

tobixdev · 2025-10-27T12:30:36Z

datafusion/physical-plan/src/recursive_query.rs

+        mut batch: RecordBatch,
    ) -> Poll<Option<Result<RecordBatch>>> {
+        let baseline_metrics = self.baseline_metrics.clone();
+        if let Some(deduplicator) = &mut self.distinct_deduplicator {


Thanks for adding metrics!

I think we could also move the metrics part to <RecursiveQueryStream as Stream>::poll_next as there is already a TODO for doing so. I believe it would be fine to update the time metrics there even if there is no deduplication going on but there might be different opinions.

Anyways, I think this is an improvement over the status quo.

Thank you for prompting me on this! I removed the TODOs but have not moved the metric code to avoid duplicating it twice (once for the static stream and once for the recursive stream).

tobixdev · 2025-10-27T12:47:49Z

datafusion/physical-plan/src/recursive_query.rs

+}
+
+/// Return a mask, each element true if the value is greater than all previous ones and greater or equal than the min_value
+fn are_increasing_mask(values: &[usize], mut min_value: usize) -> BooleanArray {


I think I understood what this function does, but I had a hard time with min_value. Maybe we can be more explicit here. Just some suggestions:

input parameter: min_value -> highest_group_id

// Always update the min_value to do de-duplication within a record batch. let mut min_value = highet_group_id;

May the integrating the comment in the doc comment for are_increasing_mask is also more than enough.

I think this assumes that the group ids are assigned in-order within the record batch but I think this is a valid assumption. Maybe someone more familiar with the aggregation infrastructure has more information on that.

I think this assumes that the group ids are assigned in-order within the record batch

yes, this is part of the GroupValues trait documentation.

I have rephrased the doc comment. I hope it's clearer now.

I have not renamed min_value to highest_group_id, the function does not depends on any specific semantic outside of creating the mask from its inputs. But happy to do the rename if you feel strongly about it.

That's perfectly fine. Just a suggestion 👍

datafusion/sqllogictest/test_files/cte.slt

tobixdev · 2025-10-27T12:51:46Z

datafusion/physical-plan/src/recursive_query.rs

+    }
+
+    fn deduplicate(&mut self, batch: &RecordBatch) -> Result<RecordBatch> {
+        // We use the hash table to allocate new group ids.


I think we can make a version of that comment the doc comment for DistinctDeduplicator::deduplicate

Indeed. I have moved the comment as a doc comment and rephrased it to be hopefully clearer.

github-actions bot added logical-expr Logical plan and expressions core Core DataFusion crate sqllogictest SQL Logic Tests (.slt) physical-plan Changes to the physical-plan crate labels Oct 23, 2025

Tpt mentioned this pull request Oct 23, 2025

feat: Naive deduplicating recursive CTE implementation #18184

Closed

Tpt changed the title ~~Deduplicating recursive CTE implementation~~ feat: Deduplicating recursive CTE implementation Oct 23, 2025

dqkqd reviewed Oct 24, 2025

View reviewed changes

Improve test

cd0701a

tobixdev approved these changes Oct 27, 2025

View reviewed changes

Improve comments

6cf4151

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Deduplicating recursive CTE implementation #18254

feat: Deduplicating recursive CTE implementation #18254

Tpt commented Oct 23, 2025 •

edited

Loading

Uh oh!

dqkqd Oct 24, 2025

Uh oh!

Tpt Oct 26, 2025

Uh oh!

tobixdev left a comment

Uh oh!

Uh oh!

tobixdev Oct 27, 2025

Uh oh!

Tpt Oct 27, 2025

Uh oh!

tobixdev Oct 27, 2025

Uh oh!

Tpt Oct 27, 2025

Uh oh!

tobixdev Oct 27, 2025

Uh oh!

Uh oh!

tobixdev Oct 27, 2025

Uh oh!

Tpt Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat: Deduplicating recursive CTE implementation #18254

Are you sure you want to change the base?

feat: Deduplicating recursive CTE implementation #18254

Conversation

Tpt commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tobixdev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tpt commented Oct 23, 2025 •

edited

Loading