-
Notifications
You must be signed in to change notification settings - Fork 42
Pull requests: meta-pytorch/tritonbench
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Grouped Gemm][CuTeDSL] Force unrolling of certain loops
cla signed
fb-exported
meta-exported
#497
opened Sep 30, 2025 by
NikhilAPatel
Loading…
further additions of dynamic benchmarking
cla signed
fb-exported
meta-exported
#496
opened Sep 30, 2025 by
adamomainz
Loading…
Add TLX FA fwd kernel
cla signed
fb-exported
meta-exported
#495
opened Sep 30, 2025 by
htyu
Loading…
Add backward compatibility for TensorDescriptor
cla signed
#457
opened Sep 19, 2025 by
bdbowyer
Loading…
Optimized Triton RMSNorm Backwards
cla signed
fb-exported
meta-exported
#456
opened Sep 19, 2025 by
PaulZhang12
Loading…
Add a Blackwell-specific scaled persistent + TMA template for GEMMs
cla signed
fb-exported
meta-exported
#432
opened Sep 17, 2025 by
jananisriram
Loading…
adding arguments to add_benchmark to match registry
cla signed
fb-exported
#381
opened Sep 2, 2025 by
adamomainz
Loading…
Add cutlass decode kernel to TritonBench
cla signed
fb-exported
meta-exported
#376
opened Aug 28, 2025 by
Aya-ZIbra
Loading…
Validate exhaustive autotuning for FP8 Inductor templates
cla signed
fb-exported
#355
opened Aug 25, 2025 by
jananisriram
Loading…
[DO NOT LAND] Try always enabling cuda graph
cla signed
#348
opened Aug 21, 2025 by
xuzhao9
Loading…
Fixing naming convention of gemms
cla signed
fb-exported
#342
opened Aug 20, 2025 by
adamomainz
Loading…
Add amax as default per-row scaling factor for fp8_gemm benchmark
cla signed
fb-exported
#341
opened Aug 20, 2025 by
jananisriram
Loading…
Move scaling logic to input generation
cla signed
fb-exported
#338
opened Aug 20, 2025 by
jananisriram
Loading…
Add benchmarking on shapes from CSV files to fp8_gemm
cla signed
fb-exported
#332
opened Aug 18, 2025 by
jananisriram
Loading…
Fix input to TritonSplitK performance benc
cla signed
fb-exported
#323
opened Aug 1, 2025 by
Aya-ZIbra
Loading…
Add TLX attention (WS pipelined pingpong hopper)
cla signed
#320
opened Jul 31, 2025 by
yf225
Loading…
Allow TMA benchmarks for flex-attention kernel
cla signed
fb-exported
#225
opened May 15, 2025 by
mandroid6
Loading…
ProTip!
Adding no:label will show everything without a label.