Update `variant_integration` test to use final approved `parquet-testing` data #8325

alamb · 2025-09-11T21:03:49Z

Which issue does this PR close?

We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax.

Closes [Variant] Integration tests for reading parquet w/ Variants #8084

Rationale for this change

Now that we have merged the upstream parquet-variant tests:

Add shredded Variant reader cases with variant logical type parquet-testing#91

We can test how far we are from the rust variant implementation working for all the values

This PR updates the test harness added #8104 by @carpecodeum to use the final parquet files and the currnet APIs

What changes are included in this PR?

Update parquet-testing pin
Update the test harness to use the standard rust test runner (#[test]) rather than a custom main function
Added links to follow on tickets

You can run this test manually like this:

cargo test --all-features --test variant_integration

...
running 138 tests
test test_variant_integration_case_106 ... ok
test test_variant_integration_case_107 ... ok
test test_variant_integration_case_109 ... ok
test test_variant_integration_case_110 ... ok
..
test test_variant_integration_case_90 ... ok
test test_variant_integration_case_91 ... ok
test test_variant_integration_case_93 ... ok
test test_variant_integration_case_83 - should panic ... ok
test test_variant_integration_case_84 - should panic ... ok

Are these changes tested?

Yes this is all tests

Are there any user-facing changes?

No

…pport

alamb · 2025-09-12T11:25:18Z

parquet-variant/src/variant/metadata.rs

+    /// Note this value may be smaller than what was passed to [`Self::new`] or
+    /// [`Self::try_new`] if the input was larger than necessary to encode the
+    /// metadata dictionary.
+    pub fn size(&self) -> usize {


I needed to expose this information because the variant metadata / data are appended in one .bin file in the test cases

alamb · 2025-09-12T11:26:09Z

parquet/tests/variant_integration.rs


-/// Test case definition structure matching the format from cases.json
-#[derive(Debug, Clone)]
+// Generate test functions for each case


I rewrote this file so the tests use the existing Rust test harness rather than our own. It does require some redundancy, but it means normal rust test execution tools work

alamb · 2025-09-12T11:26:30Z

parquet/tests/variant_integration.rs

+// - cases 40, 42, 87, 127 and 128 are expected to fail always (they include invalid variants)
+// - the remaining cases are expected to (eventually) pass
+
+variant_test_case!(1, "Unsupported typed_value type: List(");


I am quite pleased how many cases pass, actually

alamb · 2025-09-12T11:27:14Z

@carpecodeum or @mprammer do you have time to review this PR?

alamb · 2025-09-12T11:27:26Z

Also FYI @scovich @codephage2020 @liamzwbao and @klion26

alamb · 2025-09-12T11:29:08Z

@scovich one thing I have been thinking about is if there is some way to leverage this same test suite for variant_get

At the moment the tests read a shredded variant out as an unshredded varant and compares it

I was thinking maybe I could extend this more to then re-shred the variant and compare it with the original shredded one 🤔

klion26

LGTM

alamb · 2025-09-15T19:20:03Z

Thanks for the review @klion26

I'll plan to merge this tomorrow unless anyone else would like additional time to review

codephage2020

Looks Good To Me! Thanks for the clean implementation.

alamb · 2025-09-16T18:39:31Z

The test is running more and more cases before I can even merge it!

alamb · 2025-09-16T19:13:04Z

😅 -- testing for the win!

alamb added 2 commits September 11, 2025 13:50

Update parquet-testing pin to master

fca95e6

Update variant_integration tests to read and use the new shredding su…

a2ce848

…pport

github-actions bot added parquet Changes to the parquet crate parquet-variant parquet-variant* crates labels Sep 11, 2025

alamb changed the title ~~Alamb/variant tests~~ Update variant_integration to use final series Sep 11, 2025

alamb added 2 commits September 12, 2025 06:50

fix docs

136c2d8

clarify comments

675f88b

Add issue links

60d9425

alamb marked this pull request as ready for review September 12, 2025 11:18

alamb mentioned this pull request Sep 12, 2025

[EPIC] [Parquet] Implement Variant type support in Parquet #6736

Open

alamb commented Sep 12, 2025

View reviewed changes

alamb changed the title ~~Update variant_integration to use final series~~ Update variant_integration to use final approved parquet-testing data Sep 12, 2025

alamb changed the title ~~Update variant_integration to use final approved parquet-testing data~~ Update variant_integration test to use final approved parquet-testing data Sep 12, 2025

klion26 approved these changes Sep 15, 2025

View reviewed changes

Merge remote-tracking branch 'apache/main' into alamb/variant_tests

5c6c436

Update tests

94a9f2d

codephage2020 approved these changes Sep 16, 2025

View reviewed changes

mbrobbel approved these changes Sep 16, 2025

View reviewed changes

alamb added 4 commits September 16, 2025 10:54

Merge branch 'main' into alamb/variant_tests

ee7614c

Merge remote-tracking branch 'apache/main' into alamb/variant_tests

f8693ac

Return Variant::Uuid for FixedSizeBianry of 16 when possible

2f4aacf

clippy

cdb8943

alamb closed this Sep 16, 2025

alamb reopened this Sep 16, 2025

alamb merged commit 2ec77b5 into apache:main Sep 16, 2025
39 checks passed

alamb deleted the alamb/variant_tests branch September 16, 2025 19:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `variant_integration` test to use final approved `parquet-testing` data #8325

Update `variant_integration` test to use final approved `parquet-testing` data #8325

Uh oh!

alamb commented Sep 11, 2025 •

edited

Loading

Uh oh!

alamb Sep 12, 2025

Uh oh!

alamb Sep 12, 2025

Uh oh!

alamb Sep 12, 2025

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

klion26 left a comment

Uh oh!

alamb commented Sep 15, 2025 •

edited

Loading

Uh oh!

codephage2020 left a comment

Uh oh!

alamb commented Sep 16, 2025

Uh oh!

Uh oh!

alamb commented Sep 16, 2025

Uh oh!

Uh oh!

Update variant_integration test to use final approved parquet-testing data #8325

Update variant_integration test to use final approved parquet-testing data #8325

Uh oh!

Conversation

alamb commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

alamb Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

alamb Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

alamb commented Sep 12, 2025

Uh oh!

klion26 left a comment

Choose a reason for hiding this comment

Uh oh!

alamb commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codephage2020 left a comment

Choose a reason for hiding this comment

Uh oh!

alamb commented Sep 16, 2025

Uh oh!

Uh oh!

alamb commented Sep 16, 2025

Uh oh!

Uh oh!

Update `variant_integration` test to use final approved `parquet-testing` data #8325

Update `variant_integration` test to use final approved `parquet-testing` data #8325

alamb commented Sep 11, 2025 •

edited

Loading

alamb commented Sep 15, 2025 •

edited

Loading