FP8 GEMM with PyTorch Interface

Usage

Insall the kernels using the following commands:

git clone https://github.com/IST-DASLab/gemm_fp8.git
cd gemm_fp8
pip install -e .  # or pip install .

Then, the kernel can be used as follows:

import torch
import gemm_fp8
y = gemm_fp8.matmul(a, b, alpha=1.0)

where a and b are the input matrices (in torch.float8_e4m3fn format) and alpha is the scaling factor (in float).

Run the following command to benchmark the kernel:

python benchmark.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cutlass @ 902dff3		cutlass @ 902dff3
gemm_fp8		gemm_fp8
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
pyproject.toml		pyproject.toml
setup.py		setup.py