Fixes several issues with FP exceptions, like domain errors, div-by-zero, NaN, etc. #2093

stolk · 2025-09-25T16:40:31Z

This MR adds a toggle to run the vbe_test program with FP exceptions enabled.

This MR also fixes issues #2090 #2091 #2092 that were found with this facility:

FP overflow by multiplying FLT_MAX

Division by zero

Square roots of negative values.

linux-foundation-easycla · 2025-09-25T16:40:38Z

The committers listed above are authorized under a signed CLA.

✅ login: stolk / name: Bram Stolk (495fda9, 5190308, 60ef741, bc381ad)

openvdb/openvdb/tools/Diagnostics.h

openvdb/openvdb/unittest/main.cc

Idclip · 2025-09-26T23:30:30Z

openvdb/openvdb/unittest/main.cc

    ::testing::InitGoogleTest(&argc, argv);
+
+#if defined(__linux__) && defined(OPENVDB_TESTS_FPE)
+    feenableexcept(FE_DIVBYZERO | FE_INVALID | FE_OVERFLOW);


I wonder if we really want FE_OVERFLOW on by default. It's perfectly valid if it's handled and my understanding is it will fire regardless. For example, the change in Diagnostics.h:

maxVal2(_max < std::numeric_limits<ValueType>::max() ? _max*_max : _max)

can still overflow if _max is large. A better solution would be:

maxVal2([&](){ auto m2 = _max*_max; return std::isfinite(m2) ? m2 : std::numeric_limits<T>::max(); }())

But this will always trigger an overflow, even though the program is well formed. I guess it might still be useful to identify these cases - but as it causes an immediate abort it's probably not super useful.

Issue hit with FE_DIVBYZERO and FE_INVALID should (usually) always be avoided, so those definitely make sense.

First off: FLT_MAX is a special value to denote a lack of bounds, so will frequently be encountered. Values smaller than that, but larger than the sqrt(FLT_MAX) will not occur in practice.

They are either small, or they are FLT_MAX, nothing in between.

Hence the solution used here makes sense: special treatment of FLT_MAX, because it is a special case, used to signal something.

Second, I disagree on your comment on FE_OVERFLOW. It is important to catch these in unit tests.

NOTE: This change does not enable exceptions for production code, nor for applications, not for vdb_view, vdb_render, etc.

This change only (optionally) affects vdb_test, which does need to catch these cases.

If you want to make the case that FLT_MAX is special that's fine, but that is not explicitly stated for many of our tools - and saying values are either small or FLT_MAX is a huge generalization.

Yes I'm aware this only impacts the tests. There are lots of places where we check for finite and branch accordingly. So having it fire in these situations will be annoying (assuming our tests actually hit these paths).

I think adding just FE_DIVBYZERO and FE_INVALID would be a good start. That's better than what we have right now. By yeah, making it default to OFF for the unit tests would be my vote.

I agree that there are cases where we should allow overflow/underflow to happen. That's something we can guard against so that we can still globally set these flags:

fenv_t env; feholdexcept(&env); // operation that can generate overflow fesetenv(&env);

I'm not sure it's worth maintaining that level of strictness though.

To be clear, I'm fine leaving this in so long as all our tests currently pass with it enabled. I'm not a fan of arbitrarily treating FLT_MAX as a special value in places where input bounds aren't well defined - but if your change is simply to achieve the prior then for this PR it makes sense

The tests will pass because VDB_TESTS_FPE is off by default.

This is a (very useful) tool for developers that want it. It finds real bugs.

Also, on breaking tests if enabled... I really can't see that as a negative: Breaking unit tests is good. It finds bugs. Having tests pass is not the goal: having correct code is the goal.

It forces a developer to tighten up the use of floating point values. It finds invalid assumptions made by the developer.

Code that produces FP overflows is suspect, and should be flagged: 3.402823466e+38F is a really big number. If your code exceeds it, you probably did something wrong. Just like the case it found, where an 'unbounded' special value was squared, e.g.

I have removed FE_OVERFLOW so that it no longer fires by default. It will only conditionally fire if you toggle FP exceptions on via cmake, and then set the env var OPENVDB_TEST_OVERFLOW at runtime. If not set, it will only fire on FP_INVALID and FP_DIVBYZERO thereby allowing code to overflow float values.

stolk · 2025-10-11T03:55:23Z

Can someone please approve workflows?

Also, I've addressed the MR feedback, I think this is ready to get merged.

stolk · 2025-11-06T21:46:14Z

@danrbailey @Idclip @kmuseth

Hello, this MR solves real bugs, and I have revised the MR according to feedback.

Can one of you please approve the MR?

Thanks.

openvdb/openvdb/unittest/CMakeLists.txt

danrbailey · 2025-09-27T00:08:27Z

openvdb/openvdb/unittest/main.cc

    ::testing::InitGoogleTest(&argc, argv);
+
+#if defined(__linux__) && defined(OPENVDB_TESTS_FPE)
+    feenableexcept(FE_DIVBYZERO | FE_INVALID | FE_OVERFLOW);


I think adding just FE_DIVBYZERO and FE_INVALID would be a good start. That's better than what we have right now. By yeah, making it default to OFF for the unit tests would be my vote.

I agree that there are cases where we should allow overflow/underflow to happen. That's something we can guard against so that we can still globally set these flags:

fenv_t env; feholdexcept(&env); // operation that can generate overflow fesetenv(&env);

I'm not sure it's worth maintaining that level of strictness though.

openvdb/openvdb/tools/Diagnostics.h

To catch FP exceptions, build as: $ cmake -D OPENVDB_TESTS_FPE=ON ... And to get callstacks in debugger, use: $ gdb --args ./openvdb/openvdb/unittest/vdb_test --gtest_catch_exceptions=0 This toggle will let you reproduce issues AcademySoftwareFoundation#2090 AcademySoftwareFoundation#2091 AcademySoftwareFoundation#2092 Signed-off-by: Bram Stolk <[email protected]>

FIXES AcademySoftwareFoundation#2091 Signed-off-by: Bram Stolk <[email protected]>

FP exceptions were triggered in unit tests. FIXES AcademySoftwareFoundation#2092 Signed-off-by: Bram Stolk <[email protected]>

Signed-off-by: Bram Stolk <[email protected]>

stolk · 2025-11-25T19:38:05Z

Ping

stolk requested review from Idclip, danrbailey and kmuseth as code owners September 25, 2025 16:40

danrbailey reviewed Sep 25, 2025

View reviewed changes

openvdb/openvdb/tools/Diagnostics.h Outdated Show resolved Hide resolved

openvdb/openvdb/unittest/main.cc Outdated Show resolved Hide resolved

stolk force-pushed the fpe branch from 4c2a8aa to 1b837eb Compare September 25, 2025 19:08

Idclip reviewed Sep 26, 2025

View reviewed changes

stolk force-pushed the fpe branch 2 times, most recently from 5d72ffd to df7aa5d Compare October 11, 2025 03:22

stolk force-pushed the fpe branch from 25c6c94 to 832840a Compare October 15, 2025 23:52

danrbailey reviewed Nov 8, 2025

View reviewed changes

stolk added 4 commits November 11, 2025 15:01

Fix division by zero causing FP exception during unit test run.

495fda9

FIXES AcademySoftwareFoundation#2091 Signed-off-by: Bram Stolk <[email protected]>

Do not take square root of negative values.

5190308

FP exceptions were triggered in unit tests. FIXES AcademySoftwareFoundation#2092 Signed-off-by: Bram Stolk <[email protected]>

Add env var for FE_OVERFLOW.

bc381ad

Signed-off-by: Bram Stolk <[email protected]>

stolk force-pushed the fpe branch from 832840a to bc381ad Compare November 11, 2025 23:05

stolk changed the title ~~Fixes several issues with FP exceptions, like overflows, NaN, etc.~~ Fixes several issues with FP exceptions, like domain errors, div-by-zero, NaN, etc. Nov 11, 2025

Fixes several issues with FP exceptions, like domain errors, div-by-zero, NaN, etc. #2093

Are you sure you want to change the base?

Fixes several issues with FP exceptions, like domain errors, div-by-zero, NaN, etc. #2093

Uh oh!

Conversation

stolk commented Sep 25, 2025

Uh oh!

linux-foundation-easycla bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stolk commented Oct 11, 2025

Uh oh!

stolk commented Nov 6, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stolk commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

linux-foundation-easycla bot commented Sep 25, 2025 •

edited

Loading