A high-throughput and memory-efficient inference and serving engine for LLMs
☆17Apr 10, 2026Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.☆22Updated this week
- ☆15Aug 12, 2019Updated 6 years ago
- Create sales demos on k8s/OpenShift with Ansible☆16Aug 8, 2024Updated last year
- Vocabulary Parallelism☆25Mar 10, 2025Updated last year
- A Rails 8 LLM Starter Kit with Raix and RailsUI on Replit☆12Aug 24, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Use S3 object versioning for file versioning☆19Apr 5, 2026Updated last week
- ☆17Feb 2, 2024Updated 2 years ago
- Sequence-level 1F1B schedule for LLMs.☆38Aug 26, 2025Updated 7 months ago
- Ansible Playbooks to deploy a set of OpenShift Cluster onto RHEV☆17Updated this week
- Support Folding at Home by running the FAHClient on your Kubernetes cluster☆19Mar 15, 2020Updated 6 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 8 months ago
- A simple and spectacular photo-tweeting birdhouse☆83Apr 30, 2014Updated 11 years ago
- A collection of free of charge learning resources from Red Hat☆32Aug 4, 2022Updated 3 years ago
- Guide to demo NetApp Trident using Lab On Demand☆16Oct 24, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Git scrapers for scraping the fediverse☆20Updated this week
- An LLM playground similar to the OpenAI API playground☆23Dec 26, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆266Dec 4, 2025Updated 4 months ago
- AI Assistant (aia) a Ruby Gem for using genAI on the CLI☆30Updated this week
- This project provides an automated installation and deployment of Grafana, NetApp E-Series Web Services, and supporting software for perf…☆23Dec 8, 2022Updated 3 years ago
- ☆13Jun 15, 2023Updated 2 years ago
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆12Sep 28, 2024Updated last year
- ☆32Feb 27, 2023Updated 3 years ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Dec 15, 2025Updated 3 months ago
- ☆10Oct 6, 2024Updated last year
- The AI Accelerator is a template project for setting up Red Hat OpenShift AI using GitOps☆66Updated this week
- Satcom radio website☆16Jan 29, 2026Updated 2 months ago
- ☆10Aug 10, 2024Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 5 months ago
- llm-d helm charts and deployment examples☆54Apr 2, 2026Updated last week
- Quickstart for deploying WordPress to OpenShift 3.☆34Nov 19, 2020Updated 5 years ago
- a simple casual graph evaluator (for experiments)☆13Jan 3, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Travel planning Streamlit web-app based on OpenAI API☆26Sep 6, 2025Updated 7 months ago
- An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimiza…☆13Oct 13, 2025Updated 6 months ago
- 3scale toolbox☆38Nov 18, 2025Updated 4 months ago
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated last year
- Automate indexing files and pages from SharePoint into Azure AI Search☆11Jun 6, 2024Updated last year
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- A demo and tutorial for Council that implements a financial analyst agent.☆11Jun 21, 2024Updated last year