A high-throughput and memory-efficient inference and serving engine for LLMs
☆35Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,405Nov 29, 2024Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆194Jun 13, 2024Updated last year
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆14Jan 12, 2026Updated 2 months ago
- An open-source AI agent that lives in your terminal.☆33Updated this week
- Math library for JavaScript 2D/3D graphics rendering.☆11Aug 30, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jan 10, 2024Updated 2 years ago
- A Mac OS X application for recording the screen and converting to .webm (for now) -- written in Swift☆10Dec 19, 2014Updated 11 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- ☆14Jul 5, 2024Updated last year
- Artifacts for the "SurgeProtector: Mitigating Temporal Algorithmic Complexity Attacks using Adversarial Scheduling" paper that appears in…☆12Jun 24, 2022Updated 3 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Jul 1, 2024Updated last year
- ☆137May 15, 2024Updated last year
- [ICLR 2024] Thin-shell Object Manipulations with Differentiable Physics Simulations☆53Jun 5, 2024Updated last year
- Online Chat App using React JS☆10Mar 4, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- RabbitMQ on Render☆15Feb 18, 2026Updated last month
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated last year
- ☆94Mar 4, 2024Updated 2 years ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- ☆35Mar 18, 2026Updated last week
- Template CrewAI allowing for selection of multiple agents including GPT-3, GPT-4, Mixtral, Llama 3, and Gemma☆11May 11, 2024Updated last year
- ☆12Nov 11, 2025Updated 4 months ago
- URL router for ReactPy☆15Feb 14, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆51Updated this week
- The official Kamen Rider Craft The 4th Git!!☆22Sep 23, 2024Updated last year
- The Shopify Automation Toolkit☆15Apr 21, 2024Updated last year
- Llama 3 ORPO Fine Tuning on A100 in Colab Pro.☆12Apr 21, 2024Updated last year
- ☆24Nov 19, 2024Updated last year
- ☆17Apr 11, 2024Updated last year
- AI-Rag-ChatBot is a complete project example with RAGChat and Next.js 14, using Upstash Vector Database, Upstash Qstash, Upstash Redis, D…☆15Jul 10, 2025Updated 8 months ago
- ☆21Oct 4, 2025Updated 5 months ago
- ICLR Reproducibility Challenge: Generative Adversarial Models For Learning Private And Fair Representations☆12Jan 12, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Jul 24, 2018Updated 7 years ago
- MCP Server to make searching openrouter easy☆19Feb 28, 2026Updated 3 weeks ago
- Notes and to-dos organizer☆19Updated this week
- ☆20Mar 4, 2025Updated last year
- 利用 CVE-2024-0044 Android 权限提升下载任意目标App沙箱文件。☆14Sep 3, 2024Updated last year
- ☆14Aug 9, 2024Updated last year
- A high-performance tokenizer (BPE + SentencePiece) built with Rust with Python bindings, focused on speed, safety, and resource optimizat…☆57Mar 15, 2026Updated last week