QwenLM / vllm
View external linksLinks

A high-throughput and memory-efficient inference and serving engine for LLMs

☆37

Alternatives and similar repositories for vllm

Users that are interested in vllm are comparing it to the libraries listed below

Sorting:

Carol-gutianle / Awesome-llm-unlearning
View on GitHub
☆13Jun 17, 2024Updated last year
QwenLM / ConsisEval
View on GitHub
☆13Jul 5, 2024Updated last year
openai / openai-mcpkit
View on GitHub
☆41Nov 3, 2025Updated 3 months ago
rkuo2000 / GenAI
View on GitHub
☆11Feb 6, 2026Updated last week
peilongchencc / My-GLM-4-Voice
View on GitHub
ubuntu 系统下 GLM-4-Voice 部署经验分享
☆18Oct 31, 2024Updated last year
anthropics / orjson
View on GitHub
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
☆37Updated this week
anthropics / sycophancy-to-subterfuge-paper
View on GitHub
☆25Sep 5, 2024Updated last year
haidequanbu / ESC-Eval
View on GitHub
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
☆26Jun 24, 2024Updated last year
Yevanchen / markitdown-dify-plugin
View on GitHub
☆28Sep 1, 2025Updated 5 months ago
openai / go-alias
View on GitHub
Service for quickly aliasing and redirecting to long URLs
☆24Apr 26, 2023Updated 2 years ago
lvyufeng / uie_mindspore
View on GitHub
☆12Mar 21, 2024Updated last year
QwenLM / Qwen-Cookbook
View on GitHub
Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.
☆37Nov 20, 2024Updated last year
6zzhh6 / WeChat_Formatting_Tool
View on GitHub
A simple WeChat Official Account layout tool based on Dify
☆16Jun 27, 2025Updated 7 months ago
GenerativeAgents / dify-book
View on GitHub
Difyで作る生成AIアプリ完全入門
☆17May 25, 2025Updated 8 months ago
KenKaiii / b0t
View on GitHub
Workflow automation, but you just describe what you want and it happens.
☆26Nov 22, 2025Updated 2 months ago
MiniMax-AI / MiniMax-Provider-Verifier
View on GitHub
MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…
☆23Jan 15, 2026Updated 3 weeks ago
openai / compose-richtext
View on GitHub
(OpenAI Fork) A collection of Compose libraries for advanced text formatting and alternative display types.
☆78Feb 4, 2026Updated last week
MiroMindAI / MiroTrain
View on GitHub
MiroTrain is an efficient and algorithm-first framework research agent.
☆132Aug 27, 2025Updated 5 months ago
c00cjz00 / llmservice_ip
View on GitHub
☆11Aug 29, 2025Updated 5 months ago
majinkai / dify-database-to-knowledge
View on GitHub
Write the database metadata into the dify knowledge
☆12Dec 30, 2025Updated last month
sanjay-810 / AYDIV2
View on GitHub
☆12Jan 31, 2024Updated 2 years ago
aws-samples / sample-data-analyst-bi
View on GitHub
A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…
☆25Jan 6, 2026Updated last month
OneWave-AI / claude-skills
View on GitHub
100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…
☆33Oct 22, 2025Updated 3 months ago
HugoPalomares / design-intent-for-sdd
View on GitHub
☆28Dec 4, 2025Updated 2 months ago
AI45Lab / MLLMGuard
View on GitHub
☆44Jun 19, 2025Updated 7 months ago
allenai / olmo-cookbook
View on GitHub
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆64Feb 5, 2026Updated last week
we-chatter / wechatter
View on GitHub
wechatter: An easy Conversation AI Chatbot Framework
☆10Apr 15, 2021Updated 4 years ago
arnobt78 / RAG-AI-ChatBot--Redis-Vector-QStash-NextJS-FullStack
View on GitHub
AI-Rag-ChatBot is a complete project example with RAGChat and Next.js 14, using Upstash Vector Database, Upstash Qstash, Upstash Redis, D…
☆13Jul 10, 2025Updated 7 months ago
zhiyuan-zhang0206 / HomeworkAgent
View on GitHub
A multi-agent framework to help with your homework.
☆10Mar 1, 2025Updated 11 months ago
W-O-W / LangReact
View on GitHub
LangReact 是一个配置化的 Planning Agent 应用开发工具，通过配置、插件，能快速为你的 GPT 应用提供 Planning 功能。
☆12Apr 23, 2024Updated last year
junzis / course-regression
View on GitHub
Notebooks for CS4305TU Regression Lectures
☆11Oct 14, 2022Updated 3 years ago
daijun4you / great-navi
View on GitHub
☆10Dec 29, 2023Updated 2 years ago
Use-Tusk / drift-node-demo
View on GitHub
Tusk Drift Demo - Node.js Service
☆58Jan 20, 2026Updated 3 weeks ago
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆17Aug 5, 2025Updated 6 months ago
alterxyz / YTelegraph
View on GitHub
Python Telegraph api.
☆15Mar 22, 2025Updated 10 months ago
h9-tec / Qwen3_chat_local
View on GitHub
☆10Apr 30, 2025Updated 9 months ago
moltbook / moltbook-frontend
View on GitHub
Official frontend web application for Moltbook - The Social Network for AI Agents. Built with Next.js 14, TypeScript, Tailwind CSS featur…
☆25Feb 1, 2026Updated last week
AIGeeksGroup / DragMesh
View on GitHub
DragMesh: Interactive 3D Generation Made Easy
☆20Dec 28, 2025Updated last month
Boxin-Byron / Stock_Investment_Agent-AI_Foundations_Capstone_Project-_2025
View on GitHub
☆28Jun 27, 2025Updated 7 months ago

QwenLM / vllmView external linksLinks

Alternatives and similar repositories for vllm

QwenLM / vllm
View external linksLinks