lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,083Updated 11 months ago
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,035Updated last week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,671Updated this week
- Knowledge Agents and Management in the Cloud☆4,046Updated this week
- Harness LLMs with Multi-Agent Programming☆3,462Updated this week
- Optimizing inference proxy for LLMs☆2,605Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,262Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,560Updated last month
- ☆2,981Updated 9 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,263Updated last month
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,772Updated 6 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,275Updated 8 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,728Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,353Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,796Updated last week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,781Updated 6 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,033Updated last week
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,085Updated 2 weeks ago
- Desktop app for prototyping and debugging LangGraph applications locally.☆3,057Updated 2 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,394Updated this week
- The LLM Evaluation Framework☆9,001Updated this week
- Chat language model that can use tools and interpret the results☆1,569Updated 3 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,441Updated last month
- Supercharge Your LLM Application Evaluations 🚀☆9,864Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,137Updated 4 months ago
- Zep | Examples, Integrations, & More☆3,344Updated last week
- A language model programming library.☆5,798Updated last month
- ☆1,857Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,298Updated last week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆969Updated 2 months ago
- ☆908Updated 9 months ago