Skip to content

Conversation

popsiclexu
Copy link

This PR enables Moore Threads GPUDirect RDMA support for the Transfer Engine's RDMA transport. To compile and link the musa libraries, please set the USE_MUSA flag to "ON".

@alogfans
Copy link
Collaborator

alogfans commented Sep 22, 2025

Is there any mechanism similar to CUDA IPC?

Additionally, I think in the future we'd better allow the mixture use of Moore Threads GPUs and NVGPUs based on the next geenration of TE https://github.com/kvcache-ai/Mooncake/tree/dev/tent. Maybe we can figure out how to approach this goal.

@popsiclexu
Copy link
Author

Is there any mechanism similar to CUDA IPC?

Additionally, I think in the future we'd better allow the mixture use of Moore Threads GPUs and NVGPUs based on the next geenration of TE https://github.com/kvcache-ai/Mooncake/tree/dev/tent. Maybe we can figure out how to approach this goal.

Yes, MUSA has mechanism similar to CUDA IPC.

The mixture use of MTGPUS and NVGPUs based on the next geenration of TE is a great work. The usage of MUSA closely resembles that of CUDA, requiring primarily the adaptation of CUDA APIs to MUSA APIs. For reference, mooncake-transfer-engine/include/musa_porting.h is a example

@stmatengss
Copy link
Collaborator

Hi, @popsiclexu . Have you tested GDR-based PD in SGLang with Moore Threads GPU? (sry, we don't have the necessary testbed.

@popsiclexu
Copy link
Author

popsiclexu commented Sep 28, 2025

Hi, @popsiclexu . Have you tested GDR-based PD in SGLang with Moore Threads GPU? (sry, we don't have the necessary testbed.

@stmatengss Yes, I have, and it works fine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants