vllama is an open source hybrid server that combines Ollama's seamless model management with vLLM's lightning-fast GPU inference, delivering a drop-in OpenAI-compatible API for optimized performance.
☆70Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for vllama
Users that are interested in vllama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Mode inspired local sandboxed MCP Gateway - collapses N servers x M tools into 2 tools (~1,000 tokens)☆141Mar 22, 2026Updated 3 weeks ago
- Distribute and run transformer encoders with a single file.☆89Apr 10, 2026Updated last week
- Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"☆33Sep 28, 2025Updated 6 months ago
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- A ComfyUI image generation integration for oobabooga's Text Generation WebUI☆15Aug 12, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Design patterns and production-ready architectures for building multi-agent AI systems with Google ADK.☆38Mar 13, 2026Updated last month
- A simple Electron app to wrap around MPV to play VRCDN streams without any buffer.☆10Nov 1, 2023Updated 2 years ago
- Sample app that prints the compute region it’s running on☆18Apr 24, 2024Updated last year
- ☆12Apr 20, 2025Updated 11 months ago
- ☆10Feb 14, 2021Updated 5 years ago
- Python AIMP remote API wrapper with some extras☆12Dec 18, 2023Updated 2 years ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- ☆15Apr 15, 2025Updated last year
- Turning messy repos into weapons of mass structured context.☆22Feb 20, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Updated this week
- A collection python tools used to create gguf files and upload to huggingface☆17Mar 28, 2026Updated 3 weeks ago
- An application that helps you summarize your meetings in real time using OpenAI's ChatGPT APIs.☆12Mar 14, 2023Updated 3 years ago
- OntoLearner: A Modular Python Library for Ontology Learning with LLMs https://pypi.org/project/OntoLearner/☆32Apr 8, 2026Updated last week
- ✨ A high-performance code agent written in Rust, combining the best features of WCGW for maximum efficiency and semantic capabilities. 🦀☆26Apr 13, 2026Updated last week
- ☆10Jan 30, 2023Updated 3 years ago
- Detect how uv was installed and get upgrade instructions☆32Jul 23, 2025Updated 8 months ago
- A go gettable decoder/converter for HEIC/HEIF/AVIF based on libheif☆13Aug 7, 2024Updated last year
- Modern, AI-native and agentic Pythonic data transformation platform.☆58Apr 13, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Nov 28, 2025Updated 4 months ago
- Yet another web framework in Go.☆10Apr 11, 2026Updated last week
- ☆12Nov 4, 2023Updated 2 years ago
- Your AI-Powered Test Agent for Comprehensive Test Cases Generation and Effective Test Automation leveraging RAG architecture & GenAI☆19Jul 23, 2025Updated 8 months ago
- A blazingly fast, lightweight WebView framework for DCC (Digital Content Creation) software, built with Rust and Python bindings.☆31Updated this week
- VRChat Billiards - Updated and maintained.☆17Jan 29, 2021Updated 5 years ago
- Manage agents in Agentspace Agent Gallery and Agent Engine☆29Aug 22, 2025Updated 7 months ago
- use ComfyUI api in Unity☆24Jan 16, 2024Updated 2 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆31Feb 10, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Server plugin to generate TTS voices using MsEdgeTTS.☆24Apr 6, 2026Updated last week
- Python SDK for Modaic☆23Updated this week
- ☆10Apr 12, 2026Updated last week
- Getting Started with Ansible (refresh/2nd edition)☆14Apr 28, 2023Updated 2 years ago
- MMD Sphere and Toon Texture supporting shaders based on Flat Lit Toon☆16Sep 21, 2018Updated 7 years ago
- Easy integration of heterogeneous iterable data within Semantic Knowledge Graphs databases.☆19Apr 1, 2026Updated 2 weeks ago
- ☆17Jun 7, 2022Updated 3 years ago