anyscale / llm-routerView external linksLinks
Tutorial for building LLM router
☆244Jul 19, 2024Updated last year
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below
Sorting:
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,581Aug 10, 2024Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Jun 13, 2024Updated last year
- ☆39Dec 14, 2024Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆120May 29, 2025Updated 8 months ago
- ☆17Sep 1, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Jul 17, 2024Updated last year
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated 10 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆298Jan 20, 2026Updated 3 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Apr 9, 2025Updated 10 months ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,183Sep 30, 2025Updated 4 months ago
- ☆23Jul 10, 2023Updated 2 years ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆143Oct 19, 2024Updated last year
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆42Updated this week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- Examples and Demos using the Cohere APIs☆23Nov 3, 2023Updated 2 years ago
- The official Python SDK for UCP☆64Updated this week
- Engineering Blog article prototypes☆17Oct 12, 2025Updated 4 months ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆21Jul 4, 2025Updated 7 months ago
- ☆10Oct 31, 2023Updated 2 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,174Oct 8, 2024Updated last year
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,897Jan 21, 2024Updated 2 years ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆5,282Oct 30, 2025Updated 3 months ago
- ☆53Sep 3, 2024Updated last year
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,852Jan 7, 2025Updated last year
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 2 years ago
- ☆18Jan 13, 2026Updated last month
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- ☆10Jul 15, 2024Updated last year
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated last year
- ☆17May 22, 2025Updated 8 months ago
- Code Repository for the NeurIPS 2022 paper: "Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights".☆17Jul 10, 2024Updated last year
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆32Jul 12, 2024Updated last year
- ☆31Nov 14, 2024Updated last year
- This repository demonstrates how a simple memory file can be incorporated into an Agents routine using the AutoGen framework.☆50Oct 4, 2023Updated 2 years ago
- A synthetic story narration dataset to study small audio LMs.☆31Jan 21, 2024Updated 2 years ago
- Deploy your agentic worfklows to production☆2,074Jan 28, 2026Updated 2 weeks ago