lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,502Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,067Updated 3 weeks ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,141Updated last month
- Optimizing inference proxy for LLMs☆3,252Updated last week
- Harness LLMs with Multi-Agent Programming☆3,820Updated this week
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,129Updated 2 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,736Updated 2 months ago
- Knowledge Agents and Management in the Cloud☆4,227Updated 3 weeks ago
- A language model programming library.☆5,869Updated 7 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,648Updated 2 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆2,155Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,525Updated last year
- Zep | Examples, Integrations, & More☆3,905Updated 2 weeks ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,877Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,805Updated 7 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,843Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,994Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,958Updated this week
- [ICLR 2025] Automated Design of Agentic Systems☆1,480Updated 11 months ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,220Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,015Updated 2 weeks ago
- High-performance retrieval engine for unstructured data☆1,545Updated last month
- ☆3,056Updated last month
- structured outputs for llms☆12,065Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,652Updated 7 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,586Updated 2 weeks ago
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆1,086Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,155Updated this week
- ☆1,166Updated 2 weeks ago
- Adding guardrails to large language models.☆6,216Updated last week