Skip to content

Commit 3b44649

Browse files
committed
tuned: use tree instead of bruck at scale
The switch from tree to bruck between 512 and 1023 processes leads to unexpected latency changes in benchmarks of other collectives. We should be consistent here. There is no good reason for why bruck would perform better in that range but not beyond. Signed-off-by: Joseph Schuchart <[email protected]>
1 parent 1270ced commit 3b44649

File tree

1 file changed

+1
-7
lines changed

1 file changed

+1
-7
lines changed

ompi/mca/coll/tuned/coll_tuned_decision_fixed.c

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -487,14 +487,8 @@ int ompi_coll_tuned_barrier_intra_dec_fixed(struct ompi_communicator_t *comm,
487487
alg = 3;
488488
} else if (communicator_size < 256) {
489489
alg = 4;
490-
} else if (communicator_size < 512) {
491-
alg = 6;
492-
} else if (communicator_size < 1024) {
493-
alg = 4;
494-
} else if (communicator_size < 4096) {
495-
alg = 6;
496490
} else {
497-
alg = 4;
491+
alg = 6;
498492
}
499493

500494
return ompi_coll_tuned_barrier_intra_do_this (comm, module,

0 commit comments

Comments
 (0)