High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
☆163Feb 27, 2026Updated this week
Alternatives and similar repositories for olla
Users that are interested in olla are comparing it to the libraries listed below
Sorting:
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- LLm Collaboration☆12Aug 23, 2024Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆56Feb 24, 2026Updated last week
- A Terminal User Interface (TUI) application that enables interactive conversations with your documents using Large Language Models (LLM) …☆13Dec 11, 2024Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated last month
- minimalist system fetch tool in V☆25Jul 30, 2025Updated 7 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 4 months ago
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated last month
- Professional Wargaming LLM Toolbox☆20Jul 9, 2025Updated 7 months ago
- A simple Speech-to-Text (STT) / Text-to-Speech (TTS) wrapper for LLMs☆11Oct 22, 2024Updated last year
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated 2 weeks ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- RP/Writing focused LLM frontend☆61Updated this week
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆28Dec 29, 2025Updated 2 months ago
- Chat with your PDFs or text files using ChatGPT with a HTMX web UI☆13Jul 19, 2023Updated 2 years ago
- Kick is an AI-powered assistant that provides voice and keyboard control over your Windows device, enabling seamless automation of your d…☆16Jul 29, 2025Updated 7 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Jul 13, 2025Updated 7 months ago
- AN AI based Calorie Tracker Application☆19Oct 25, 2025Updated 4 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 8 months ago
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆16Jan 25, 2026Updated last month
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆71Oct 8, 2025Updated 4 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated 9 months ago
- AI model Prompt Tester (AIPT for short) is a simple app that will check how suitable each model is for a given prompt.☆15Jul 7, 2024Updated last year
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- ☆101Oct 3, 2025Updated 4 months ago
- Awesome LLM speech-to-speech models and frameworks☆40Nov 17, 2025Updated 3 months ago
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated last month
- JotItNow is a AI Voice Notes App☆24Mar 6, 2025Updated 11 months ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated 2 months ago
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆54Mar 22, 2025Updated 11 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 2 months ago
- ☆21Aug 22, 2024Updated last year
- CrewAI template for Autonomeee agnet.☆18Oct 1, 2024Updated last year
- A Multi-Agentic AI Assistant/Builder☆25Jan 23, 2026Updated last month