Tutorial for building LLM router
☆253Jul 19, 2024Updated last year
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,976Aug 10, 2024Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆165Jun 13, 2024Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆120May 29, 2025Updated last year
- A curated list of awesome approaches to AI model routing☆208Mar 24, 2025Updated last year
- ☆44Dec 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆205Jul 17, 2024Updated last year
- Framework for Cost-Effective Language Model Choice☆16Dec 12, 2023Updated 2 years ago
- ☆15Mar 29, 2025Updated last year
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆30Apr 8, 2025Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆165Oct 19, 2024Updated last year
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,219Apr 8, 2026Updated 2 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆302May 21, 2026Updated 3 weeks ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆70Jun 29, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,178Oct 8, 2024Updated last year
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,904Jan 7, 2025Updated last year
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- Examples and Demos using the Cohere APIs☆24Nov 3, 2023Updated 2 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆19Jun 16, 2023Updated 2 years ago
- ☆31Feb 25, 2026Updated 3 months ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- Automated testing and benchmarking for code generation agents.☆18Jun 27, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆22Jul 4, 2025Updated 11 months ago
- ☆23Jul 10, 2023Updated 2 years ago
- ☆37May 5, 2025Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆120Apr 27, 2026Updated last month
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆5,617Mar 19, 2026Updated 2 months ago
- ☆10Oct 31, 2023Updated 2 years ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- ☆210Jun 26, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads☆2,748Jun 25, 2024Updated last year
- A workbench application to test out different prompts on a variety of AI models to see how they perform☆16Feb 9, 2025Updated last year
- Tools for merging pretrained large language models.☆19Jun 12, 2024Updated last year
- Optimizing inference proxy for LLMs☆4,135May 7, 2026Updated last month
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 5 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆347Jun 16, 2024Updated last year
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆3,596Jul 25, 2025Updated 10 months ago