lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆3,968Updated 9 months ago
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,009Updated this week
- Harness LLMs with Multi-Agent Programming☆3,349Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,473Updated 2 weeks ago
- Optimizing inference proxy for LLMs☆2,427Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,604Updated 3 weeks ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,111Updated 2 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,671Updated this week
- Knowledge Agents and Management in the Cloud☆3,984Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,224Updated 7 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,985Updated last week
- Chat language model that can use tools and interpret the results☆1,553Updated 3 weeks ago
- ☆2,951Updated 8 months ago
- A language model programming library.☆5,766Updated 3 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,774Updated last week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,689Updated 5 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,051Updated last week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆4,266Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,755Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,712Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,073Updated 2 months ago
- Desktop app for prototyping and debugging LangGraph applications locally.☆2,880Updated 2 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,060Updated 2 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,190Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETL☆1,987Updated last week
- The AI-native proxy server for agents. Arch handles the pesky low-level work in building agentic apps like calling specific tools, routin…☆2,641Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,090Updated last week
- The easiest way to use Agentic RAG in any enterprise☆4,234Updated 4 months ago
- structured outputs for llms☆10,603Updated this week
- Zep | The Memory Foundation For Your AI Stack☆3,297Updated last month
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆6,904Updated this week