Skip to content

Conversation

@alamb
Copy link
Collaborator

@alamb alamb commented Sep 8, 2025

Note this targets #73 and merging this PR will update that PR

Rationale (why)

@stuartcarnie is working on a PachaTree related feature (see this slack thread) that involves merge join of binary columns

In order to use this feature, we need to backport the change from apache#17431 to DataFusion 49.0.2 on our fork

Changes (what):

  1. git cherry-pick 7b65c5b (cherry pick 7b65c5b from DataFusion) to this branch
  2. resolve conflicts

…he#17431)

* feat: Support binary data types for `SortMergeJoin` `on` clause

* Add sql level tests for merge join on binary keys

---------

Co-authored-by: Andrew Lamb <[email protected]>
DataType::Utf8 => compare_value!(StringArray),
DataType::Utf8View => compare_value!(StringViewArray),
DataType::LargeUtf8 => compare_value!(LargeStringArray),
DataType::Binary => compare_value!(BinaryArray),
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code changes are very small, most of this PR is tests

@crepererum
Copy link

Thanks. I've manually cherry-picked the commit since I don't trust GH to not mess up my linear history.

@crepererum crepererum closed this Sep 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants