-
Notifications
You must be signed in to change notification settings - Fork 158
NIXLBENCH: Add GPU tests to CI #780
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
52 commits
Select commit
Hold shift + click to select a range
b10bce6
NIXLBENCH: Add GPU tests to CI
ovidiusm 7a47b98
Remove redundant parameter
ovidiusm a7cdcde
Use two workers in UCX tests
ovidiusm 49305eb
Remove CUDA compat paths
ovidiusm 2c18242
Build UCX with CUDA support if possible
ovidiusm 083dc6b
Debug
ovidiusm 62b94f0
Fix build
ovidiusm 607fdfc
Fix build
ovidiusm 344d019
Cleanup
ovidiusm 28d6601
Fix tests on non-GPU workers
ovidiusm e2ae443
Skip broken test
ovidiusm e099bfe
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm 2e234ca
Silence telemetry errors
ovidiusm 090b9c6
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm f08de65
Track registrations
ovidiusm af73eca
Revert "Track registrations"
ovidiusm 93d91fb
Silence error
ovidiusm 36b6fe7
Isolate gtest
ovidiusm f27977c
Run gtest via gtest-parallel
ovidiusm 0a102e5
Revert code removal
ovidiusm cbb5dbf
Add timeout for build
ovidiusm 35e75b9
Install gtest-parallel in Dockerfile
ovidiusm 1656495
Cleanup scripts
ovidiusm 10ba539
Cleanup unrelated changes
ovidiusm 5e336c1
Use CUDA stubs when there is no GPU, to fix CI envs without GPU
ovidiusm 25d2b7b
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm 842eea0
Try another way of loading CUDA libs on CI workers without GPUs
ovidiusm 9aab08a
Revert changes to CUDA load path
ovidiusm 43e0d5a
Check what happens with UCX CUDA auto-detection
ovidiusm 4091b1e
Try another way of setting lib path
ovidiusm 00f5df0
Simplify
ovidiusm c2bbc2b
Revert removal of UCX cuda option
ovidiusm ee12fe4
Move back the lib loading path
ovidiusm 4e600b9
Fix SIGINT, add more workers
ovidiusm de81063
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm 4993804
Use a single worker in tests
ovidiusm 9b34995
Adjust number of parallel workers depending on environment
ovidiusm 6d12cd1
Add etcd namespace isolation for unit tests
ovidiusm 2e9703f
Use a single gtest worker without GPU
ovidiusm a9ebaa5
Reduce the number of gtest workers to 1 (gitlab CI fails with paralle…
ovidiusm c3e6a28
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm b7bc101
Refactor scripts
ovidiusm 78deae5
Attempt run without gtest-parallel
ovidiusm f3dd55d
Revert "Attempt run without gtest-parallel"
ovidiusm ec3f080
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm 8f4bb95
Workaround DGX issue
ovidiusm 1c4b7e8
Merge remote-tracking branch 'dynamo/main' into nixlbench-gpu-tests
ovidiusm d0b3178
Fix for std::runtime_error exception on connection close
ovidiusm 7ba49db
Remove redundant tests
ovidiusm d60cbf4
Remove peer in place
ovidiusm 154b398
Revert "Remove peer in place"
ovidiusm 99749f5
Revert "Fix for std::runtime_error exception on connection close"
ovidiusm File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.