ztxz16 / exvllmView on GitHub
vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill
31Nov 7, 2025Updated 3 months ago

Alternatives and similar repositories for exvllm

Users that are interested in exvllm are comparing it to the libraries listed below

Sorting:

Are these results useful?