A high-throughput and memory-efficient inference and serving engine for LLMs
☆42Jan 26, 2025Updated last year
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 17, 2024Updated 2 years ago
- ☆17Mar 22, 2024Updated 2 years ago
- ☆61May 21, 2026Updated 3 weeks ago
- ☆23Dec 16, 2025Updated 6 months ago
- ☆22Feb 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Mar 21, 2024Updated 2 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- ☆46Jun 19, 2025Updated 11 months ago
- TAP parser for .NET☆27Sep 19, 2019Updated 6 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Javascript SDK for interacting with the MCP Toolbox for Databases.☆73Updated this week
- my personal mcp server☆13Apr 23, 2025Updated last year
- Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.☆51May 18, 2026Updated last month
- Azure Command-Line Interface☆15Mar 26, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆55May 5, 2026Updated last month
- A simple python library for Pexels.com. This package covers, search photos, curated photos, and get an individual photo as well as search…☆12Feb 7, 2022Updated 4 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Watermarking LLM papers up-to-date☆12Dec 17, 2023Updated 2 years ago
- 主题:计算认知科学(Computational Cognitive Science)。此仓库诞生背景为IA003结业BP,仍处于萌芽期,内容设置有待转正。下一次大规模更新估计在三四年之后。☆17May 22, 2019Updated 7 years ago
- A curated list of open-source projects related to MoonshotCoder.☆37May 22, 2024Updated 2 years ago
- ☆37Jul 31, 2025Updated 10 months ago
- Service for quickly aliasing and redirecting to long URLs☆26Apr 26, 2023Updated 3 years ago
- ☆15Dec 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆40Oct 8, 2025Updated 8 months ago
- 2022龙芯杯个人赛三等奖作品☆14Oct 11, 2023Updated 2 years ago
- ☆18Mar 15, 2021Updated 5 years ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆23Feb 16, 2025Updated last year
- 本项目利用深度学习技术,实时检测人体3D姿态,并基于此预测未来人体动作。采用mmpose框架与多进程技术实现后端快速预测,利用混合现实Hololens2头戴显示器显示人物动作,做到实时抓取,实时预测,实时显示。☆12Oct 30, 2023Updated 2 years ago
- MeloTTS demo on Axera☆13Nov 18, 2025Updated 7 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆15Sep 4, 2025Updated 9 months ago
- ☆11Apr 5, 2022Updated 4 years ago
- Open Co Scientist aims to democratize scientific research by providing an open-source implementation of an AI co-scientist system.☆15Mar 1, 2025Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆73May 29, 2026Updated 2 weeks ago
- 一个简洁高效的 AI 命令行助手,支持对话、命令生成、文件处理。☆17Sep 16, 2025Updated 9 months ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated 2 years ago
- React app for inspecting, building and debugging with the Realtime API☆11Nov 5, 2024Updated last year