A high-throughput and memory-efficient inference and serving engine for LLMs
☆16Mar 20, 2026Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vocabulary Parallelism☆25Mar 10, 2025Updated last year
- A Rails 8 LLM Starter Kit with Raix and RailsUI on Replit☆12Aug 24, 2025Updated 7 months ago
- ☆17Feb 2, 2024Updated 2 years ago
- ☆14May 10, 2024Updated last year
- GPT3 Chrome Extension Starter Kit☆16Jan 16, 2023Updated 3 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 8 months ago
- Tabler Rails Starter - Give your Rails app a head start with a premium, open-source dashboard template that offers a responsive and high-…☆16Nov 21, 2025Updated 4 months ago
- Git scrapers for scraping the fediverse☆20Updated this week
- An LLM playground similar to the OpenAI API playground☆22Dec 26, 2023Updated 2 years ago
- Breaking same-domain policy one request at a time☆52Jun 16, 2011Updated 14 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Dec 4, 2025Updated 3 months ago
- code pattern and instructions to deploy intelligent loan web app☆11Sep 17, 2025Updated 6 months ago
- Multi-user password-store☆20Jun 5, 2024Updated last year
- ☆24Nov 17, 2016Updated 9 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆47Jul 12, 2024Updated last year
- A port of Open AI's Swarm library written in Ruby☆22Oct 14, 2024Updated last year
- A curated list of awesome stuff built using Amazon Alexa☆23Jun 23, 2016Updated 9 years ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- ☆12Dec 15, 2025Updated 3 months ago
- ☆10Oct 6, 2024Updated last year
- Satcom radio website☆15Jan 29, 2026Updated last month
- ☆63Jul 21, 2024Updated last year
- ☆10Aug 10, 2024Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 4 months ago
- Travel planning Streamlit web-app based on OpenAI API☆26Sep 6, 2025Updated 6 months ago
- An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimiza…☆11Oct 13, 2025Updated 5 months ago
- Automate indexing files and pages from SharePoint into Azure AI Search☆11Jun 6, 2024Updated last year
- A jQuery plugin for text that grows on you☆45Oct 20, 2015Updated 10 years ago
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated last year
- OSINT tool for researching targets based on Email Address or Username using SerperDev, Firecrawl, HIBP & OSINT Industries APIs, and OpenA…☆16Sep 17, 2024Updated last year
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆21Sep 18, 2025Updated 6 months ago
- A demo and tutorial for Council that implements a financial analyst agent.☆11Jun 21, 2024Updated last year
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…☆17Mar 31, 2025Updated 11 months ago
- Airline AI Agent with Langflow and DataStax Astra☆12Jan 14, 2025Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- ☆15Jul 5, 2024Updated last year
- This is a controller for the WE-R2.4 robot on Thingiverse.☆13Aug 19, 2019Updated 6 years ago
- This project showcases a comprehensive authentication system using Supabase as the backend, implemented with both Streamlit and FastAPI f…☆17Aug 30, 2024Updated last year