Add dot(), dot3(), and dot4() methods to SimdFloat and tests to existing dot product functions #491

GrigoryEvko · 2025-11-16T20:07:21Z

Quality of life improvement for dot product operations. Entirely optional - feel free to reject if this adds
unnecessary API surface.

Methods:

dot(rhs) - Full dot product (alias for (self * rhs).reduce_sum())
dot3(rhs) - 3D dot product (first 3 elements, w component ignored)
dot4(rhs) - 4D dot product (first 4 elements only)

Rationale:

dot3()/dot4() reduce boilerplate in graphics/physics code
dot() provides discoverability and matches industry libraries (NumPy, GLM, Eigen)
All methods are inline, zero-cost abstractions
26 comprehensive tests included

Implementation:

dot() delegates to (self * rhs).reduce_sum()
dot3()/dot4() use compile-time assertions for size safety
Zero allocations, pure SIMD register operations

Testing:

Mathematical properties (commutativity, distributivity, scaling)
Special values (infinity, NaN propagation)
Size variations (f32x2, f32x4, f32x8, f64x2, f64x4, f64x8)
Edge cases (orthogonal vectors, zero vectors, negative values)

Happy to close if you feel it's not worth the API complexity.

Quality of life improvement for dot product operations. Entirely optional - feel free to reject if this adds unnecessary API surface. Methods: - dot(rhs) - Full dot product (sugar for (self * rhs).reduce_sum()) - dot3(rhs) - 3D dot product (first 3 elements, w component ignored) - dot4(rhs) - 4D dot product (first 4 elements only) Rationale: - dot3()/dot4() reduce boilerplate in graphics/physics code - dot() provides discoverability and matches industry libraries (NumPy, GLM, Eigen) - All methods are inline, zero-cost abstractions Implementation: - dot() delegates to (self * rhs).reduce_sum() - dot3()/dot4() use compile-time assertions for size safety - Zero allocations, pure SIMD register operations Testing (26 tests): - Mathematical properties (commutativity, distributivity, scaling) - Special values (infinity, NaN, MAX, MIN, subnormals) - Size variations (f32x2, f32x4, f32x8, f64x2, f64x4, f64x8) - Edge cases (orthogonal vectors, zero vectors, negative values) - w component correctly ignored in dot3() - Elements beyond index correctly ignored in dot3()/dot4() The decision to include this is completely up to maintainers. Happy to close if you feel it's not worth the API complexity. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

ARM NEON uses flush-to-zero (FTZ) for subnormal values in SIMD operations. Updated tests to accept either the correct subnormal result or zero.

GrigoryEvko and others added 3 commits November 16, 2025 23:03

Fix subnormal value tests for ARM NEON FTZ mode

dda2284

ARM NEON uses flush-to-zero (FTZ) for subnormal values in SIMD operations. Updated tests to accept either the correct subnormal result or zero.

Remove stable stdarch_x86_avx512 feature

995ad15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add dot(), dot3(), and dot4() methods to SimdFloat and tests to existing dot product functions #491

Add dot(), dot3(), and dot4() methods to SimdFloat and tests to existing dot product functions #491

Uh oh!

GrigoryEvko commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add dot(), dot3(), and dot4() methods to SimdFloat and tests to existing dot product functions #491

Are you sure you want to change the base?

Add dot(), dot3(), and dot4() methods to SimdFloat and tests to existing dot product functions #491

Uh oh!

Conversation

GrigoryEvko commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant