LLMRouter: An Open-Source Library for LLM Routing
☆1,633Mar 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for LLMRouter
Users that are interested in LLMRouter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆146Jan 21, 2026Updated 2 months ago
- ☆29Jan 11, 2026Updated 3 months ago
- Production-ready Python library for multi-provider LLM orchestration☆41Oct 10, 2025Updated 6 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆658Mar 21, 2026Updated 3 weeks ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆7,969Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆33Sep 20, 2025Updated 6 months ago
- Claude Code for CUDA. Free AI assistant that actually understands GPU architecture☆104Oct 10, 2025Updated 6 months ago
- Knowledge Engine for AI Agent Memory in 6 lines of code☆15,206Updated this week
- [ICLR 2025] Simulating Human-like Daily Activities with Desire-driven Autonomy☆23Jan 4, 2026Updated 3 months ago
- [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models☆136Dec 25, 2025Updated 3 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆639Mar 3, 2026Updated last month
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆71Dec 22, 2025Updated 3 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆201Updated this week
- [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …☆10,743Apr 4, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Build Real-Time Knowledge Graphs for AI Agents☆24,798Updated this week
- PDLP algorithm for linear programming☆94Dec 31, 2025Updated 3 months ago
- Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing s…☆6,287Updated this week
- Deep research agents using MiniMax M2.1 interleaved thinking☆205Dec 23, 2025Updated 3 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,643Updated this week
- Bootstraps a fresh Ubuntu VPS into a complete multi-agent AI development environment in 30 minutes: coding agents, session management, sa…☆1,374Updated this week
- A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when bui…☆14,875Mar 22, 2026Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,773Aug 10, 2024Updated last year
- Agent Skills as a Memory Layer☆3,307Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆17Feb 11, 2026Updated 2 months ago
- The absolute trainer to light up AI agents.☆16,648Apr 3, 2026Updated last week
- ☆72Jan 18, 2026Updated 2 months ago
- Implementation of the MetaController proposed in "Emergent temporal abstractions in autoregressive models enable hierarchical reinforceme…☆96Mar 31, 2026Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,422Nov 13, 2025Updated 5 months ago
- llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired…☆78Updated this week
- Curated plugin marketplace for AI agents - works with Claude Code, Codex, and openskills☆955Mar 22, 2026Updated 3 weeks ago
- Universal memory layer for AI Agents☆52,987Updated this week
- ☆23Dec 30, 2025Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A TypeScript Model Context Protocol (MCP) server to allow LLMs to programmatically construct mind maps to explore an idea space, with enf…☆25Mar 23, 2025Updated last year
- TOON as DSPy adapter☆25Feb 1, 2026Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76,536Updated this week
- The most accurate document search and store for building AI apps☆3,568Apr 2, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆42,652Updated this week
- The LLM Evaluation Framework☆14,728Updated this week
- ☆17Apr 7, 2025Updated last year