pwilkin / llama-runnerLinks
Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends
☆50Updated 5 months ago
Alternatives and similar repositories for llama-runner
Users that are interested in llama-runner are comparing it to the libraries listed below
Sorting:
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆131Updated last week
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆51Updated this week
- ☆19Updated 6 months ago
- Orpheus Chat WebUI☆75Updated 9 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆22Updated 9 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆48Updated 2 months ago
- A web application that converts speech to speech 100% private☆82Updated 7 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆52Updated 2 weeks ago
- The easiest & fastest way to run LLMs in your home lab☆78Updated last month
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆56Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 8 months ago
- Give your local LLM a real memory with a lightweight, fully local memory system. 100% offline and under your control.☆66Updated 4 months ago
- A persistent local memory for AI, LLMs, or Copilot in VS Code.☆187Updated 2 months ago
- Generate Your Own Private Morning Radio for Commute☆32Updated 11 months ago
- ☆178Updated 5 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆165Updated last month
- Crow is a Desktop AI Assistant☆32Updated last year
- ☆83Updated 10 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- ☆58Updated 11 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆47Updated 4 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Updated 11 months ago
- A novel media player that allows you to navigate by speaker☆85Updated last month
- ☆54Updated 7 months ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 4 months ago
- Dashboard v5 Coming Soon!!☆63Updated 3 weeks ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆50Updated 4 months ago
- Simple node proxy for llama-server that enables MCP use☆16Updated 8 months ago
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆63Updated 3 months ago