[Feature] Support for Mooncake integration

### Checklist

- [x] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [x] 2. Please use English, otherwise it will be closed.

### Motivation

Mooncake is a distributed KVCache storage engine specifically designed for inference with large language models (LLM) based on Transfer Engine. It is a central component in the KVCache-centric distributed architecture. The goal of Mooncake is to store reusable KV caches at various locations within the inference cluster.

Integrate Mooncake as a role in the RBG-deployed SGLang inference service, providing KVCache offload capabilities for the inference service.


### Related resources

[sglang-integration](https://github.com/kvcache-ai/Mooncake/blob/main/doc/en/sglang-integration-v1.md)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Support for Mooncake integration #74

Checklist

Motivation

Related resources

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Support for Mooncake integration #74

Description

Checklist

Motivation

Related resources

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions