vllama is an open source hybrid server that combines Ollama's seamless model management with vLLM's lightning-fast GPU inference, delivering a drop-in OpenAI-compatible API for optimized performance.
☆65Nov 21, 2025Updated 3 months ago
Alternatives and similar repositories for vllama
Users that are interested in vllama are comparing it to the libraries listed below
Sorting:
- Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"☆32Sep 28, 2025Updated 5 months ago
- A template to run Lanchain Powered App using Chainlit Front UI☆13Aug 1, 2023Updated 2 years ago
- ☆12Apr 20, 2025Updated 10 months ago
- Using Random Forest algorithm to detect automated accounts on Twitter and Instagram☆11Jun 21, 2024Updated last year
- DSPYT Website - Fostering Innovation and Education in Web3 Technologies☆12Feb 15, 2026Updated 2 weeks ago
- A cross-platform named-pipe implementation.☆10Aug 18, 2013Updated 12 years ago
- 2D-Doc parser library☆15Jan 11, 2026Updated last month
- A collection python tools used to create gguf files and upload to huggingface☆17Updated this week
- Three-Speed Memory for OpenClaw Agents — QMD working memory + Zvec hybrid search + auto-compaction☆26Updated this week
- MCP server for GNU Radio☆31Jan 5, 2026Updated 2 months ago
- A Custom Connection pooler for the SurrealDB Python SDK.☆12Nov 12, 2025Updated 3 months ago
- ☆12Jan 29, 2023Updated 3 years ago
- It's an open source restaurant and coffee shop management system.☆10Jan 6, 2023Updated 3 years ago
- ☆10Jan 30, 2023Updated 3 years ago
- The official PyTorch implementation of VM-ASR, a model designed for high-fidelity audio super-resolution.☆19Sep 8, 2025Updated 5 months ago
- Unstructured.io API GUI☆11Aug 6, 2023Updated 2 years ago
- Build TypeScript functions that are durable by default; no PhD required.☆15Apr 3, 2025Updated 11 months ago
- Tupi is an open/free 2D animation software based on usability, a friendly user experience and community values. Every human is an artist …☆10Oct 29, 2015Updated 10 years ago
- ☆10Feb 14, 2021Updated 5 years ago
- Puppeteer Recaptcha solver☆10Jan 10, 2024Updated 2 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- A De/CompressionStream for Bun☆13Jul 9, 2025Updated 7 months ago
- BH hackathon☆14Apr 4, 2024Updated last year
- Python SDK for Modaic☆23Updated this week
- Open source design systems, expressed as DTCG JSON☆14Feb 19, 2026Updated 2 weeks ago
- Hardware and software for smartphone sensor peripherals using the audio jack interface.☆13Jan 10, 2014Updated 12 years ago
- flutter boilerplate☆14Sep 14, 2021Updated 4 years ago
- DEPRECATED: Convert OpenSCAD to JSCAD (See the link below)☆23Dec 30, 2018Updated 7 years ago
- An electron Wrapper for Open-Interpreter for the lablab.ai hackathon☆12Oct 14, 2023Updated 2 years ago
- Angular google place library☆15Updated this week
- Exploring retrieval systems for language models☆14Apr 12, 2025Updated 10 months ago
- DB design & Web interface☆10Jun 11, 2023Updated 2 years ago
- Install Wireguard systemlessly☆11Dec 27, 2017Updated 8 years ago
- Detect how uv was installed and get upgrade instructions☆29Jul 23, 2025Updated 7 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Updated this week
- A hackable library for running and fine-tuning modern transformer models on commodity and alternative GPUs, powered by tinygrad.☆28Feb 10, 2026Updated 3 weeks ago
- Linux GUI for ThinkPad laptops☆10May 22, 2017Updated 8 years ago
- ☆10Nov 6, 2024Updated last year
- A simple Electron app to wrap around MPV to play VRCDN streams without any buffer.☆10Nov 1, 2023Updated 2 years ago