SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.
☆54Updated this week
Alternatives and similar repositories for SmarterRouter
Users that are interested in SmarterRouter are comparing it to the libraries listed below
Sorting:
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated 3 weeks ago
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated last week
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 9 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- Run Orpheus 3B Locally With LM Studio☆32Mar 20, 2025Updated 11 months ago
- run prometheus rootless and distroless☆18Feb 11, 2026Updated 2 weeks ago
- Unofficial Python client for the complete Open WebUI API.☆31Feb 17, 2026Updated last week
- ☆51Oct 1, 2025Updated 4 months ago
- Open WebUI tool — Give your LLM a persistent workspace with file storage, SQLite, archives, and collaboration.☆58Feb 2, 2026Updated 3 weeks ago
- Mikrotik NGINX Reverse Proxy☆27May 1, 2024Updated last year
- Simple html ollama chatbot that is easy to install. Simply copy the html file on your computer and run it.☆47Jun 18, 2025Updated 8 months ago
- ☆25Apr 26, 2025Updated 9 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated last month
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 2 weeks ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 7 months ago
- ☆73May 19, 2025Updated 9 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Feb 17, 2026Updated last week
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆154Updated this week
- Mem0 Integration with OpenWebUI☆53Jan 19, 2026Updated last month
- ☆14Updated this week
- Wakeword Installer for Home Assistant☆19Jun 1, 2025Updated 8 months ago
- A free, offline, private AI text-to-speech desktop app built on Rust 🦜☆23Updated this week
- ☆18Nov 17, 2022Updated 3 years ago
- [READ ONLY] Subtree split of the siyuan-packages-monorepo (see https://github.com/Zuoqiu-Yingyi/siyuan-packages-monorepo)☆12Jan 23, 2024Updated 2 years ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- ☆17Feb 4, 2026Updated 3 weeks ago
- ☆123Updated this week
- 个人微信接入openclaw的插件,实现通过微信小程序与openclaw之间进行会话通讯。直接通过微信小程序ClawChat与OpenClaw进行对话,让OpenClaw做你让他做的事情。并且可以随时随地获得OpenClaw的回复。☆29Updated this week
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- ☆12Jun 1, 2025Updated 8 months ago
- ☆10Sep 29, 2024Updated last year
- ☆45Dec 1, 2025Updated 2 months ago
- custom backplane for diy nas (4xSATA)☆16May 12, 2024Updated last year
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- Generate random character (PCs or NPCs) backgrounds using the "Central Casting: Heroes of Legend" book☆11Sep 13, 2023Updated 2 years ago