Skip to content

Conversation

gopinath-vasalamarri
Copy link

@gopinath-vasalamarri gopinath-vasalamarri commented Dec 14, 2024

Remove imul instruction from murmur operations of 32bit hash functions -- reduces the overall number of instructions by ~1-3 CPU cycles.

Before -- https://godbolt.org/z/ed93nf1sx
After -- https://godbolt.org/z/3h1rG399P

Remove imul instruction from murmur operations of 32bit hash functions -- reduces the number of instructions by 1 CPU cycle.

Before -- https://godbolt.org/z/GWvnEoPa3
After -- https://godbolt.org/z/4zEoM5G1W
@lowasser
Copy link
Contributor

I see the difference in bytecode, but is there a difference in benchmarks? I'd tend to expect both versions to get optimized to the same thing.

@gopinath-vasalamarri
Copy link
Author

yeah, running a BM is good idea. By any chance, do we have one for testing the performance of the strategies ?. if so, could you please point me out on how to run them.

thanks

@cgdecker cgdecker added type=performance Related to performance P3 no SLO package=hash labels Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P3 no SLO package=hash type=performance Related to performance
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants