High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
☆172Mar 13, 2026Updated last week
Alternatives and similar repositories for olla
Users that are interested in olla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLm Collaboration☆12Aug 23, 2024Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆57Feb 24, 2026Updated 3 weeks ago
- Awesome LLM speech-to-speech models and frameworks☆44Nov 17, 2025Updated 4 months ago
- Professional Wargaming LLM Toolbox☆21Jul 9, 2025Updated 8 months ago
- Install Commands for XMRig on Raspberry Pi - How to Install XMRig on Raspberry pi☆23Jan 6, 2025Updated last year
- Model Context Protocol management suite/factory. An MCP that can generate and manage other local MCPs in multiple languages. Uses the off…☆39Aug 29, 2025Updated 6 months ago
- Sen Chat is a browser extension that streamlines your online experience by integrating AI chat, advanced web search, document interaction…☆41Dec 22, 2025Updated 3 months ago
- minimalist system fetch tool in V☆25Jul 30, 2025Updated 7 months ago
- ☆100Oct 3, 2025Updated 5 months ago
- Kick is an AI-powered assistant that provides voice and keyboard control over your Windows device, enabling seamless automation of your d…☆16Jul 29, 2025Updated 7 months ago
- AI model Prompt Tester (AIPT for short) is a simple app that will check how suitable each model is for a given prompt.☆15Jul 7, 2024Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated 2 months ago
- JotItNow is a AI Voice Notes App☆25Mar 6, 2025Updated last year
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆72Oct 8, 2025Updated 5 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 7 months ago
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆33Nov 8, 2025Updated 4 months ago
- Runtime intelligence system that makes MCP servers debuggable, testable, and safe to run in production.☆44Feb 17, 2026Updated last month
- A guide and example project for setting up an open Makefile based embedded development tool-chain☆25Sep 4, 2018Updated 7 years ago
- DockaShell is an MCP server that gives AI agents isolated Docker containers to work in. MCP tools for shell access, file operations, and …☆29Jun 6, 2025Updated 9 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- ☆13May 25, 2023Updated 2 years ago
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆16Jan 25, 2026Updated last month
- Advanced Character tutorial with sea3d and three.js☆22Feb 1, 2019Updated 7 years ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆13May 30, 2025Updated 9 months ago
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated 2 months ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Mar 13, 2026Updated last week
- Opinionated Go Project Template☆13Updated this week
- A simple Speech-to-Text (STT) / Text-to-Speech (TTS) wrapper for LLMs☆11Oct 22, 2024Updated last year
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 2 months ago
- A c_import macro for Rust☆14Apr 21, 2025Updated 11 months ago
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆80Aug 16, 2025Updated 7 months ago
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 7 months ago
- 📺 TVx — the warmth of modern nostalgia This is the way - television you remember feeling: present, unhurried, *analog*☆34Dec 15, 2025Updated 3 months ago
- AI-powered tool to organize photos into Instagram-ready posts with smart captions and hashtags. Supports both Ollama (local) and Gemini (…☆65Sep 19, 2025Updated 6 months ago
- Description on how to make ITM trace running on STM32F4 family with JLink/JTrace/STLink adapters.☆13Aug 18, 2020Updated 5 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 2 months ago
- Natural-sounding Text-to-Speech App that fits anywhere. Fast, Real-Time and flexible.☆56Mar 6, 2026Updated 2 weeks ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year