fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

MengAiDev · 2025-08-26T04:21:51Z

Add stream->wait() to ensure all kernels finish execution before proceeding
This resolves potential race conditions in the argsort operation

- Add `stream->wait()` to ensure all kernels finish execution before proceeding - This resolves potential race conditions in the argsort operation

simonlui · 2025-08-26T05:31:32Z

@MengAiDev The closing brace for the function is missing so it fails to compile when I tried to check out the branch. I added an extra line to close it with } and it works.

NeoZhangJianyu · 2025-08-27T01:59:03Z

#15580 support on iGPU.
Could you check if dGPU has this issue?
if no, maybe add the condition to check the iGPU and add wait() for iGPU only.

It could reduce the protentional risk to dGPU.

simonlui · 2025-08-27T02:06:57Z

@NeoZhangJianyu I have an Intel Arc A770 16GB and can confirm the issue existed on my dGPU too. This is a snippet from the backtrace I posted in the issue.
/home/simonlui/Code_Repositories/llama-cpp-python/vendor/llama.cpp/ggml/src/ggml-sycl/ggml-sycl.cpp:3380: GGML_ASSERT(row_id_i >= 0 && row_id_i < n_as) failed
Same assert error as iGPU.

NeoZhangJianyu · 2025-08-27T03:06:32Z

@NeoZhangJianyu I have an Intel Arc A770 16GB and can confirm the issue existed on my dGPU too. This is a snippet from the backtrace I posted in the issue. /home/simonlui/Code_Repositories/llama-cpp-python/vendor/llama.cpp/ggml/src/ggml-sycl/ggml-sycl.cpp:3380: GGML_ASSERT(row_id_i >= 0 && row_id_i < n_as) failed Same assert error as iGPU.

OK! Thank you for your feedback!
It's OK to me!

MengAiDev · 2025-08-28T00:28:56Z

I have fix the }

fix(ggml-sycl): add synchronization before exiting argsort kernel

ce79ded

- Add `stream->wait()` to ensure all kernels finish execution before proceeding - This resolves potential race conditions in the argsort operation

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Aug 26, 2025

NeoZhangJianyu approved these changes Aug 27, 2025

View reviewed changes

fix

afb6f45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

MengAiDev commented Aug 26, 2025 •

edited

Loading

Uh oh!

simonlui commented Aug 26, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

simonlui commented Aug 27, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

MengAiDev commented Aug 28, 2025

Uh oh!

Uh oh!

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

Are you sure you want to change the base?

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

Conversation

MengAiDev commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonlui commented Aug 26, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

simonlui commented Aug 27, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

MengAiDev commented Aug 28, 2025

Uh oh!

Uh oh!

MengAiDev commented Aug 26, 2025 •

edited

Loading