Skip to content

[Feature]: Implement distributed Flash Decode + Latency Tests #73

@octatrifan-amd

Description

@octatrifan-amd

Suggestion Description

Similar to triton-distributed, but with fused kernels using Iris

Operating System

No response

GPU

No response

ROCm Component

No response

Metadata

Metadata

Labels

enhancementNew feature or requestexamplesExamples showcasing Iris APIs and usage

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions