A high-throughput and memory-efficient inference and serving engine for LLMs
☆17May 21, 2026Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.☆23Updated this week
- ☆14Aug 12, 2019Updated 6 years ago
- Vocabulary Parallelism☆26Mar 10, 2025Updated last year
- ☆17Feb 2, 2024Updated 2 years ago
- Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU☆21May 10, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14May 10, 2024Updated 2 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 10 months ago
- A collection of useful working GPTs☆12Nov 19, 2023Updated 2 years ago
- Create app stacks loaded with all your favourite clients, services and infra along with code boilerplates in under 5 mins.☆13Jan 19, 2023Updated 3 years ago
- Tabler Rails Starter - Give your Rails app a head start with a premium, open-source dashboard template that offers a responsive and high-…☆16Nov 21, 2025Updated 6 months ago
- Reads an Atom feed and posts its entries to Instagram (basically feed2toot, but for Instagram)☆22Dec 24, 2024Updated last year
- Pack a Ruby application into an executable jar file☆12Apr 8, 2026Updated last month
- Git scrapers for scraping the fediverse☆21Updated this week
- An LLM playground similar to the OpenAI API playground☆24Dec 26, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- AI Assistant (aia) a Ruby Gem for using genAI on the CLI☆30May 3, 2026Updated 3 weeks ago
- A simple python package to stretch audio files and change their speed☆12Feb 18, 2026Updated 3 months ago
- code pattern and instructions to deploy intelligent loan web app☆11Sep 17, 2025Updated 8 months ago
- ☆24Nov 17, 2016Updated 9 years ago
- An LLM inference engine, written in C++☆19Mar 30, 2026Updated last month
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆13Sep 28, 2024Updated last year
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- A port of Open AI's Swarm library written in Ruby☆22Oct 14, 2024Updated last year
- a ros node using face_net do face_recognition☆12Jul 27, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Dec 15, 2025Updated 5 months ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- A curated list of awesome stuff built using Amazon Alexa☆23Jun 23, 2016Updated 9 years ago
- The AI Accelerator is a template project for setting up Red Hat OpenShift AI using GitOps☆67May 15, 2026Updated last week
- ☆10Aug 10, 2024Updated last year
- 本书是《5G Mobile Networks : A Systems Approach》(https://5g.systemsapproach.org/)的中文版翻译。☆13Jun 26, 2022Updated 3 years ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10May 11, 2026Updated last week
- a simple casual graph evaluator (for experiments)☆13Jan 3, 2019Updated 7 years ago
- ☆18Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimiza…☆13Oct 13, 2025Updated 7 months ago
- Python library for working with SEC Edgar☆11Apr 4, 2024Updated 2 years ago
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated 2 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- AOP Alliance Source Code☆13Jun 14, 2013Updated 12 years ago
- Educational tools for AI Ethics and Safety research 🛠️🔬☆27Jan 7, 2025Updated last year
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集 成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 9 months ago