-
Notifications
You must be signed in to change notification settings - Fork 12.7k
vulkan: fuse adds #15252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vulkan: fuse adds #15252
Conversation
Fuse adds that have the same shape, which are common in MoE models. It will currently fuse up to 6 adds, because we assume no more than 8 descriptors per dispatch. But this could be changed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good on AMD and Nvidia, but I can't get it to run on Intel.
terminate called after throwing an instance of 'vk::DeviceLostError'
what(): vk::Device::waitForFences: ErrorDeviceLost
I'll investigate further later.
Strange. Any validation failures? Does the backend test fail, or just in real models? |
Yeah, the test fails too on Intel:
Edit: No validation failures. Probably a driver bug. |
Shall I just disable the optimization for Intel? |
Yeah, I don't see why it's failing. |
Fuse adds that have the same shape, which are common in MoE models. It will currently fuse up to 6 adds, because we assume no more than 8 descriptors per dispatch. But this could be changed.