vllama is an open source hybrid server that combines Ollama's seamless model management with vLLM's lightning-fast GPU inference, delivering a drop-in OpenAI-compatible API for optimized performance.
☆73Nov 21, 2025Updated 5 months ago
Alternatives and similar repositories for vllama
Users that are interested in vllama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distribute and run transformer encoders with a single file.☆103Updated this week
- Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"☆34Sep 28, 2025Updated 7 months ago
- The PyTorch Library for LLM Applications.☆16Jul 16, 2024Updated last year
- ☆16Jan 4, 2025Updated last year
- ☆16Feb 7, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A ComfyUI image generation integration for oobabooga's Text Generation WebUI☆15Aug 12, 2025Updated 8 months ago
- A simple Electron app to wrap around MPV to play VRCDN streams without any buffer.☆10Nov 1, 2023Updated 2 years ago
- Sample app that prints the compute region it’s running on☆18Apr 24, 2024Updated 2 years ago
- ☆10Feb 14, 2021Updated 5 years ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- A Subgraph for Lens Protocol☆11Oct 17, 2022Updated 3 years ago
- Advanced Terraform with GCP, Wes Coffay, January 2021☆11Mar 1, 2022Updated 4 years ago
- ☆15Apr 15, 2025Updated last year
- UdonSharp toggle script for standard buttons with useful options.☆13Jul 31, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Apr 14, 2026Updated 3 weeks ago
- Simple voting app demo☆13Dec 3, 2021Updated 4 years ago
- A collection python tools used to create gguf files and upload to huggingface☆17Mar 28, 2026Updated last month
- ☆27Aug 7, 2025Updated 9 months ago
- 3rd-person camera prefabs for VRChat world creation☆14Apr 21, 2023Updated 3 years ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆65Apr 28, 2026Updated last week
- ☆52Mar 12, 2026Updated last month
- A guide for configuring TouchDesigner and OBS for VJing in VRChat☆12Oct 1, 2024Updated last year
- An example terraform + terragrunt repository. Features Google Kubernetes Engine, Google Cloud SQL, Google Cloud Proxy, NGINX.