lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,309Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,822Updated this week
- Deploy your agentic worfklows to production☆2,052Updated last month
- Knowledge Agents and Management in the Cloud☆4,163Updated this week
- Harness LLMs with Multi-Agent Programming☆3,720Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,466Updated 6 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,487Updated 11 months ago
- Optimizing inference proxy for LLMs☆2,951Updated last week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,810Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,451Updated 4 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆2,005Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆4,936Updated 3 weeks ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,924Updated 9 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,119Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,895Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,690Updated 4 months ago
- ☆3,028Updated last year
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,113Updated 2 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,535Updated 3 months ago
- structured outputs for llms☆11,555Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,788Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,506Updated 4 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,335Updated 8 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,486Updated this week
- A language model programming library.☆5,844Updated 4 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,691Updated last year
- High-performance retrieval engine for unstructured data☆1,501Updated 2 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,428Updated 8 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆2,978Updated 2 months ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆1,175Updated 4 months ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,212Updated this week