[Variant] Allow lossless casting from integer to floating point #8357

scovich · 2025-09-16T09:19:49Z

Which issue does this PR close?

We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax.

Closes #NNN.

Rationale for this change

Historically, Variant::as_fXX methods don't even try to cast int values as floating point, which is counter-intuitive.

What changes are included in this PR?

Allow lossless casting of variant integer values to variant floating point values, by a naive determination of precision:

Every floating point number has some number of bits of precision
- 53 (double)
- 24 (single)
- 11 (half)
Any integer that fits entirely inside the target floating point type's precision can be converted losslessly
- This produces an intuitive result: "too big" numbers fail to convert, while "small enough" numbers do convert.
- This is a sufficient but not a necessary condition.
- Technically, wider integer can be represented losslessly as well, as long as they have enough trailing zeros
- It's unclear whether allowing those wider values to cast is actually helpful in practice, because only 1 in 2**k values can cast (where k is the number of bits of excess precision); it would certainly make input testing more expensive.

Are these changes tested?

New unit tests and doc tests.

Are there any user-facing changes?

Yes. Values that failed to cast before now succeed.

scovich · 2025-09-16T09:20:32Z

CC @klion26 @alamb -- should be a quickie

alamb

make sense to me -- thanks @scovich

klion26

LGTM, thanks for the improvement.

klion26 · 2025-09-17T02:47:07Z

parquet-variant/src/utils.rs

@@ -144,3 +144,20 @@ pub(crate) const fn expect_size_of<T>(expected: usize) {
        let _ = [""; 0][size];
    }
 }
+
+pub(crate) fn fits_precision<const N: u32>(n: impl Into<i64>) -> bool {


Put N into the generic, not the parameter, because this can yield better performance, do I understand correctly?

It was mostly out of habit for small integer manipulation utilities... But now that you mention, it probably doesn't matter in the slightest -- the compiler will anyway inline aggressively and the constant arg will be folded in regardless of whether it's a generic arg or a function arg.

That said, there's one potential advantage to keeping the generic arg: Otherwise, it could be ambiguous which of two integer args is the precision and which is the actual value. Any preferences?

Thanks for the detailed reply, I'm fine with the current implementation

klion26 · 2025-09-17T02:48:58Z

parquet-variant/src/variant.rs

@@ -1096,13 +1096,21 @@ impl<'m, 'v> Variant<'m, 'v> {
    /// let v2 = Variant::from(std::f64::consts::PI);
    /// assert_eq!(v2.as_f16(), Some(f16::from_f64(std::f64::consts::PI)));
    ///
+    /// // and from integers with no more than 11 bits of precision


Not sure if we need to add an overflow example here(e.g, Variant::from(2048) for f16)

The unit test for fits_precision does test overflow, both positive and negative.
Do we need additional (indirect) test coverage here?

Maybe we need to update the doc(the comment said that -- Returns Some(f16) for float and double variants, None for non-floating-point variants) or add an example for integer overflow (`Variant::from(2048) for f16 -- will return None)

I don't have a strong preference here, it's just an idea that popped into my mind when I saw this

Oh! Good catch on the doc comment. Updated all three.

alamb · 2025-09-17T15:24:48Z

🚀

alamb · 2025-09-17T15:24:55Z

Thanks @scovich and @klion26

[Variant] Allow lossless casting from integer to floating point

03a674d

github-actions bot added the parquet-variant parquet-variant* crates label Sep 16, 2025

use i64:BITS

afff124

alamb approved these changes Sep 16, 2025

View reviewed changes

klion26 approved these changes Sep 17, 2025

View reviewed changes

scovich added 2 commits September 16, 2025 21:16

Merge remote-tracking branch 'oss/main' into variant-int-to-float-casts

1683803

review feedback

5d9d837

scovich mentioned this pull request Sep 17, 2025

[Variant] Define new shred_variant function #8366

Open

alamb merged commit d6f40ce into apache:main Sep 17, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Variant] Allow lossless casting from integer to floating point #8357

[Variant] Allow lossless casting from integer to floating point #8357

scovich commented Sep 16, 2025

Uh oh!

scovich commented Sep 16, 2025

Uh oh!

alamb left a comment

Uh oh!

klion26 left a comment

Uh oh!

klion26 Sep 17, 2025

Uh oh!

scovich Sep 17, 2025

Uh oh!

klion26 Sep 17, 2025

Uh oh!

klion26 Sep 17, 2025

Uh oh!

scovich Sep 17, 2025

Uh oh!

klion26 Sep 17, 2025

Uh oh!

scovich Sep 17, 2025 •

edited

Loading

Uh oh!

alamb commented Sep 17, 2025

Uh oh!

Uh oh!

alamb commented Sep 17, 2025

Uh oh!

Uh oh!

[Variant] Allow lossless casting from integer to floating point #8357

[Variant] Allow lossless casting from integer to floating point #8357

Conversation

scovich commented Sep 16, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

scovich commented Sep 16, 2025

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

klion26 left a comment

Choose a reason for hiding this comment

Uh oh!

klion26 Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

klion26 Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

klion26 Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

klion26 Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

scovich Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alamb commented Sep 17, 2025

Uh oh!

Uh oh!

alamb commented Sep 17, 2025

Uh oh!

Uh oh!

scovich Sep 17, 2025 •

edited

Loading