Add vLLM Semantic Router Blog #77

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

youkaichao wants to merge 8 commits into main from add-vsr-blog

+115 −0

Member

youkaichao commented Sep 1, 2025

move #76 here to enable previews

Xunzhuo added 2 commits

September 1, 2025 16:01


          docs: add vllm semantic router blog

5fd1503

Signed-off-by: bitliu <[email protected]>


          resolve reviews

77d7553

Signed-off-by: bitliu <[email protected]>

cloudflare-workers-and-pages bot commented Sep 1, 2025 •

edited

Loading

Deploying vllm-blog-source with Cloudflare Pages

Latest commit:	`87c5b92`
Status:	✅ Deploy successful!
Preview URL:	https://9c8df877.vllm-blog-source.pages.dev
Branch Preview URL:	https://add-vsr-blog.vllm-blog-source.pages.dev

youkaichao mentioned this pull request

docs: add vllm semantic router blog #76

Closed

youkaichao commented

View reviewed changes

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

Xunzhuo changed the title ~~Add vsr blog~~ Add vLLM Semantic Router Blog


          update: resolve feedbacks

71ca42d

Signed-off-by: bitliu <[email protected]>

rootfs commented Sep 1, 2025 •

edited

Loading

@youkaichao thank you for reviewing, is it ready to go for publishing today? Thank you!

pacoxu reviewed

View reviewed changes

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

windsonsea reviewed

View reviewed changes

windsonsea left a comment

Removing AI-generated tags and style can make the content more inviting for human to read.

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

rootfs reviewed

View reviewed changes

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

Xunzhuo requested review from rootfs and windsonsea

September 3, 2025 11:52

Xunzhuo force-pushed the add-vsr-blog branch from 3306f01 to 7198318 Compare

September 3, 2025 11:55


          resolve reviews

6b879b5

Signed-off-by: bitliu <[email protected]>

Xunzhuo force-pushed the add-vsr-blog branch from 7198318 to 6b879b5 Compare

September 3, 2025 11:58

rootfs approved these changes

View reviewed changes

yuezhu1 reviewed

View reviewed changes

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved


          update: resolve feedbacks

2f757dd

Signed-off-by: bitliu <[email protected]>

wangchen615 reviewed

View reviewed changes

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved

_posts/2025-09-01-semantic-router.md Outdated Show resolved Hide resolved


          resolve reviews

11717a3

Signed-off-by: bitliu <[email protected]>

Member

Xunzhuo commented Sep 4, 2025

Preview is here: https://c8e293a4.vllm-blog-source.pages.dev/2025/09/01/semantic-router


          Merge branch 'main' into add-vsr-blog

d46785e

vercel bot commented Sep 10, 2025 •

edited

Loading

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
vllm-project-github-io	Ready	Preview	Comment	Sep 10, 2025 3:57am

vercel bot deployed to Preview

September 10, 2025 03:57

View deployment

Contributor

simon-mo commented Sep 10, 2025

I reviewed the blog. Two comments

I would suggest tuning down the "business value" of semantic router. I believe (1) it is a speculation (2) it distract the technical value of this post
Can you run the ModernBERT with vLLM? If so, we should put that down as future work, if not, we should clearly explain why

Member

Xunzhuo commented Sep 10, 2025

cool, thanks for the review @simon-mo, yep, for the first one, I think we need to emphasize the technique part.

And for the second one, I think you raised a key point which is in our roadmap: pluggable embedding model architecture, so for the modernBERT, that is something lightweight and embedded inside the router, and for other embedding model which can be deployed by vLLM engine and that can be also integrated with vsr with external call.

rootfs commented Sep 10, 2025

Thank you @simon-mo for the review!

Can you run the ModernBERT with vLLM? If so, we should put that down as future work, if not, we should clearly explain why

At the moment, the semantic router uses the modernBERT for internal classification. However, we will explore more ways to get text embedding for semantic cache. Many of these models can be hosted by vLLM and I believe this will be more extensible.

We'll detail these directions and use cases in the upcoming revision!

vercel bot deployed to Preview

September 11, 2025 03:44

View deployment


          update

87c5b92

Signed-off-by: bitliu <[email protected]>

Xunzhuo force-pushed the add-vsr-blog branch from 44b3c2e to 87c5b92 Compare

September 11, 2025 03:47

vercel bot deployed to Preview

September 11, 2025 03:48

View deployment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Xunzhuo Xunzhuo left review comments

windsonsea Awaiting requested review from windsonsea

+4 more reviewers

pacoxu pacoxu left review comments

yuezhu1 yuezhu1 left review comments

wangchen615 wangchen615 left review comments

rootfs rootfs approved these changes

Labels

None yet