Skip to main content

🏆 Papers & Talks

Innovation thrives when great minds come together ❤️

🏆 Research Publications

🏆
RESEARCH PUBLICATION

When to Reason: Semantic Router for vLLM

Authors:Chen Wang, Xunzhuo Liu, Yuhan Liu, Yue Zhu, Xiangxi Mo, Junchen Jiang, Huamin Chen
Venue:NeurIPS - MLForSys2025
We present a semantic router that classifies queries based on their reasoning requirements and selectively applies reasoning only when beneficial.
vLLMSemantic Router
🏆
RESEARCH PUBLICATION

Semantic Inference Routing Protocol (SIRP)

Authors:Huamin Chen, Luay Jalil
Venue:Internet Engineering Task Force (IETF)2025
This document specifies the Semantic Inference Routing Protocol (SIRP), a framework for content-level classification and semantic routing in AI inference systems.
vLLMSemantic Router

🏆 Conference Presentations

🤗
CONFERENCE PRESENTATION

Intelligent LLM Routing: A New Paradigm for Multi-Model AI Orchestration in Kubernetes

Speakers:Chen Wang, Huamin Chen
Venue:KubeCon NA 20252025
This research-driven talk introduces a novel architecture paradigm that complements recent advances in timely intelligent inference routing for large language models.
vLLMSemantic Router
🤗
CONFERENCE PRESENTATION

vLLM Semantic Router: Unlock the Power of Intelligent Routing

Speakers:Xunzhuo Liu
Venue:vLLM Meetup Beijing2025
A deep dive into vLLM Semantic Router capabilities, demonstrating how intelligent routing can unlock new possibilities for efficient LLM inference.
vLLMSemantic Router
🤗
CONFERENCE PRESENTATION

AI-Powered vLLM Semantic Router

Speakers:Huamin Chen
Venue:vLLM Office Hours2025
An overview of AI-powered features in vLLM Semantic Router, showcasing the latest developments and community contributions.
vLLMSemantic Router