lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,581Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,073Updated last week
- Optimizing inference proxy for LLMs☆3,317Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,812Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,849Updated 8 months ago
- Harness LLMs with Multi-Agent Programming☆3,874Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,154Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,273Updated 2 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,924Updated 5 months ago
- Agentic components of the Llama Stack APIs☆4,289Updated 6 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆2,193Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,705Updated 8 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,006Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,074Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,562Updated last year
- Knowledge Agents and Management in the Cloud☆4,231Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,852Updated last year
- ☆3,071Updated 2 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,823Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,525Updated 8 months ago
- ☆1,193Updated last month
- The easiest way to use Agentic RAG in any enterprise☆4,396Updated last year
- Efficient Retrieval Augmentation and Generation Framework☆1,763Updated 3 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,653Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,986Updated 5 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆4,010Updated last week
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,053Updated 9 months ago
- A language model programming library.☆5,878Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,592Updated last month
- RAG that intelligently adapts to your use case, data, and queries☆3,687Updated 3 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,130Updated 3 months ago