pwilkin / llama-runnerLinks

Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends

☆48

Alternatives and similar repositories for llama-runner

Users that are interested in llama-runner are comparing it to the libraries listed below

Sorting:

TesslateAI / TFrameX
☆173Updated 3 months ago
rhulha / Speech2Speech
A web application that converts speech to speech 100% private
☆81Updated 5 months ago
boneylizard / Eloquent
A local front-end for open-weight LLMs with memory, RAG, TTS/STT, Elo ratings, and dynamic research tools. Built with React and FastAPI.
☆38Updated 3 months ago
thushan / olla
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…
☆117Updated this week
quentin-r37 / sortify-ai
☆56Updated 9 months ago
PkmX / orpheus-chat-webui
Orpheus Chat WebUI
☆75Updated 7 months ago
akashjss / orpheus-tts-local-webui
Run Orpheus 3B Locally with Gradio UI, Standalone App
☆22Updated 7 months ago
savantskie / persistent-ai-memory
A persistent local memory for AI, LLMs, or Copilot in VS Code.
☆170Updated 3 weeks ago
yazon / flexllama
🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…
☆41Updated 2 weeks ago
TAR-ALEX / llm-html
☆19Updated 4 months ago
wsmlby / homl
The easiest & fastest way to run LLMs in your home lab
☆72Updated 2 months ago
KartDriver / mira_converse
☆83Updated 8 months ago
Lanerra / saga
Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.
☆81Updated this week
goodreasonai / nichey
Generate a wiki for your research topic, sourcing from the web and your docs.
☆52Updated 8 months ago
intelligencedev / eternal
Eternal is an experimental platform for machine learning models and workflows.
☆67Updated 8 months ago
ExoFi-Labs / OllamaGTTS
☆190Updated 7 months ago
victorcarre6 / llm-memorization
Give your local LLM a real memory with a lightweight, fully local memory system. 100% offline and under your control.
☆62Updated 2 months ago
dkruyt / webollama
A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…
☆59Updated last month
k-koehler / gguf-tensor-overrider
☆49Updated last month
PasiKoodaa / ACE-Step-RADIO
ACE-Step: A Step Towards Music Generation Foundation Model
☆45Updated 6 months ago
MehulG / memX
A real-time shared memory layer for multi-agent LLM systems.
☆49Updated 4 months ago
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆83Updated 3 weeks ago
thooton / aspen
Personal voice assistant, with voice interruption and Twilio support
☆18Updated 8 months ago
calmstate / polyglot
Polyglot is a fast, elegant, and free translation tool using AI.
☆63Updated last year
matteoserva / GraphLLM
☆208Updated 2 months ago
Mahrkeenerh / lfind
A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.
☆28Updated 8 months ago
TesslateAI / Agent-Builder
☆192Updated 2 months ago
platinum-hill / cobolt
This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support
☆225Updated 3 months ago
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆145Updated last month
AlgorithmicKing737 / orpheus-tts-local-openai
Run Orpheus 3B Locally With LM Studio
☆31Updated 8 months ago