Route LLM requests to the best model for the task at hand.
☆293May 7, 2026Updated last month
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆268Jun 1, 2026Updated 2 weeks ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Mar 7, 2024Updated 2 years ago
- This repo provides some examples of how to build and consume App Actions on Windows.☆20Feb 11, 2026Updated 4 months ago
- ☆32Jun 22, 2025Updated 11 months ago
- SonicBOOM Spectre Attacks☆11Jul 18, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Feb 20, 2024Updated 2 years ago
- Pipecat framework based orchestrator for building real-time, voice-enabled, and multimodal conversational AI agents☆57Mar 3, 2026Updated 3 months ago
- A curated list of awesome approaches to AI model routing☆209Mar 24, 2025Updated last year
- ☆41May 2, 2025Updated last year
- ☆172Apr 16, 2026Updated 2 months ago
- MCP server for intelligent code search: semantic (RAG), symbolic (tree-sitter), and regex (ripgrep) search modes. Built for Claude Code a…☆16Sep 12, 2025Updated 9 months ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Automated LLM Coding Tournaments. There can be only one (winning code solution from the competing AIs)☆52Apr 15, 2026Updated 2 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆131Mar 28, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 4 years ago
- Offline-first, decentralized graph database of collaborative Web apps☆15May 12, 2017Updated 9 years ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆5,041Aug 10, 2024Updated last year
- A centralized WebSocket Twitch Chat interface☆14Jan 12, 2023Updated 3 years ago
- This is a Streamlit-based application designed to revolutionize the real estate search process with the power of AI. Utilizing Qdrant for…☆12Feb 14, 2024Updated 2 years ago
- Tutorial for building LLM router☆253Jul 19, 2024Updated last year
- CVPR 2024, Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching☆22Mar 25, 2025Updated last year
- ☆22Sep 6, 2025Updated 9 months ago
- OrcBench: A Representative Serverless Benchmark☆14Nov 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 7 months ago
- YunoHost DynDNS Server☆13May 23, 2026Updated 3 weeks ago
- Transform Unstructured Data into Synthetic Datasets☆26Sep 3, 2024Updated last year
- Ultra-fast, customizable AI voice dictation in any active app on Windows (MacOS and Linux coming soon)☆37Mar 8, 2026Updated 3 months ago
- A Datacenter Scale Distributed Inference Serving Framework☆7,248Updated this week
- Integrate Github Action with Amazon DynamoDB☆15Jun 11, 2026Updated last week
- Multi-agent system for booking appointments and generating PDF invoices☆14Jul 16, 2025Updated 11 months ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- A GPU Accelerated Binary Vector Store☆47Feb 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆20May 1, 2023Updated 3 years ago
- ☆15Jun 8, 2026Updated last week
- Volcengine Object Storage(TOS) JavaScript SDK☆12Apr 7, 2026Updated 2 months ago
- Benchmarks for Business Document Foundation Models☆10Apr 4, 2024Updated 2 years ago
- TaskVanguard - LLM / AI Wrapper for TaskWarrior via API (OpenAI, Deepseek etc.)☆43Feb 7, 2026Updated 4 months ago
- Benchmarks to capture important workloads.☆33Apr 1, 2026Updated 2 months ago
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆59Mar 26, 2026Updated 2 months ago