Skip to content

Conversation

@abhilash1910
Copy link

@Godofnothing Thanks for creating this repository and supporting faster gemms.
I am currently working on AutoGPTQ extension for SYCL runtime (AutoGPTQ/AutoGPTQ#638) . Since the build of the asm instructions for Marlin are from here, I propose to have an analogous SYCL counterpart in this repository.
I believe this addition would help us (Intel and SYCL in general) to actively benchmark against ptx ISA and check for performance gaps . This would also open avenues on non Intel hardware to use the SYCL runtime. [Creating a draft PR now]
Also tagging @fxmarty (autoGPTQ) for info. Thanks

@abhilash1910 abhilash1910 marked this pull request as draft August 15, 2024 13:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant