lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,553Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,071Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,210Updated 2 months ago
- Optimizing inference proxy for LLMs☆3,288Updated last month
- Harness LLMs with Multi-Agent Programming☆3,847Updated last week
- Knowledge Agents and Management in the Cloud☆4,229Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,788Updated 2 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,912Updated 4 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,998Updated 3 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,827Updated 8 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,068Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,684Updated 8 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,128Updated 3 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,560Updated last year
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆2,178Updated 2 weeks ago
- ☆3,069Updated 2 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,520Updated 8 months ago
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆1,127Updated 2 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,393Updated last year
- RAG that intelligently adapts to your use case, data, and queries☆3,676Updated 2 months ago
- Structured Outputs☆13,282Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,156Updated this week
- structured outputs for llms☆12,185Updated last week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,889Updated last week
- ☆1,186Updated last month
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,003Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,979Updated 5 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,629Updated last month
- Chat language model that can use tools and interpret the results☆1,592Updated last month
- Build. Observe. Iterate. Ship.☆1,347Updated this week
- High-performance retrieval engine for unstructured data☆1,551Updated 2 months ago