A high-throughput and memory-efficient inference and serving engine for LLMs
☆37Jan 26, 2025Updated last year
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆16May 16, 2025Updated 9 months ago
- ☆13Jun 17, 2024Updated last year
- ☆15Mar 22, 2024Updated last year
- ☆13Jul 5, 2024Updated last year
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆18Oct 31, 2024Updated last year
- ☆44Nov 3, 2025Updated 4 months ago
- ☆11Feb 25, 2026Updated last week
- Asterisk Model Context Protocol (MCP) server.☆31Mar 24, 2025Updated 11 months ago
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆40Feb 8, 2026Updated 3 weeks ago
- ☆25Sep 5, 2024Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Jun 24, 2024Updated last year
- ☆28Sep 1, 2025Updated 6 months ago
- Service for quickly aliasing and redirecting to long URLs☆25Apr 26, 2023Updated 2 years ago
- ☆12Mar 21, 2024Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆37Nov 20, 2024Updated last year
- A curated list of open-source projects related to MoonshotCoder.☆35May 22, 2024Updated last year
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- ☆23Updated this week
- (OpenAI Fork) A collection of Compose libraries for advanced text formatting and alternative display types.☆80Feb 12, 2026Updated 3 weeks ago
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- ☆28Dec 4, 2025Updated 3 months ago
- Data Plane Development Kit☆11Jan 27, 2026Updated last month
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 3 weeks ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated 2 weeks ago
- 参考《上海交通大学生存手册》开源☆16Sep 25, 2024Updated last year
- In-depth documentation for Gnoland developers, providing introductions, deep tutorials, and development resources.☆12Mar 12, 2024Updated last year
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- my personal mcp server☆13Apr 23, 2025Updated 10 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- ☆44Jun 19, 2025Updated 8 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Feb 27, 2026Updated last week
- Device tree for the Samsung Galaxy S10 (SM-G973F)☆11Jul 23, 2021Updated 4 years ago
- ☆17Aug 5, 2025Updated 7 months ago
- A small framework to benchmark forecasting models via backtesting☆13Nov 25, 2023Updated 2 years ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆23Feb 26, 2026Updated last week
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- A universal skills runtime framework SDK for building, deploying, and executing modular capabilities across diverse environments.☆27Updated this week