We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent d1d106e commit dec6badCopy full SHA for dec6bad
benchmarks/run_config/fp8_gemm_rowwise_fwd.yaml
@@ -1,3 +1,3 @@
1
fp8_gemm_rowwise_fwd:
2
op: fp8_gemm_rowwise
3
- args: --op fp8_gemm_rowwise --only _triton --no_fp8_fast_accum --use_tma --use_persistent --metrics tflops --num-inputs 1 --input-id 7
+ args: --op fp8_gemm_rowwise --only _triton --no_fp8_fast_accum --use_persistent --no_use_tma --metrics tflops --num-inputs 1 --input-id 7
0 commit comments