llama-swap + a minimal ollama compatible api
β57Mar 14, 2026Updated last month
Alternatives and similar repositories for llama-swappo
Users that are interested in llama-swappo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLMβ13May 30, 2025Updated 10 months ago
- π FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GPβ¦β56Mar 5, 2026Updated last month
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.β19Jan 10, 2025Updated last year
- β13Jun 18, 2024Updated last year
- β20Jul 4, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Qwen Code OpenAI Wrapper with Cloudflare Workersβ56Apr 6, 2026Updated 2 weeks ago
- Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMsβ14Mar 8, 2026Updated last month
- Windows CLI clipping toolβ24Dec 29, 2022Updated 3 years ago
- Simple node proxy for llama-server that enables MCP use