lm-sys / RouteLLMLinks

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

☆4,132

Alternatives and similar repositories for RouteLLM

Users that are interested in RouteLLM are comparing it to the libraries listed below

Sorting:

langroid / langroid
Harness LLMs with Multi-Agent Programming
☆3,545Updated last week
run-llama / llama_deploy
Deploy your agentic worfklows to production
☆2,045Updated 2 weeks ago
aurelio-labs / semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
☆2,691Updated last week
microsoft / LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆5,300Updated 4 months ago
codelion / optillm
Optimizing inference proxy for LLMs
☆2,695Updated this week
AgentOps-AI / tokencost
Easy token price estimates for 400+ LLMs. TokenOps.
☆1,754Updated this week
run-llama / llama_cloud_services
Knowledge Agents and Management in the Cloud
☆4,069Updated last week
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,587Updated 2 months ago
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,333Updated 2 months ago
e2b-dev / code-interpreter
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
☆1,898Updated 2 weeks ago
kingjulio8238 / Memary
The Open Source Memory Layer For Autonomous Agents
☆2,287Updated 9 months ago
character-ai / prompt-poet
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
☆1,093Updated last week
getzep / zep
Zep | Examples, Integrations, & More
☆3,445Updated this week
SciPhi-AI / R2R
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
☆7,123Updated last month
KruxAI / ragbuilder
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
☆1,451Updated 2 months ago
mistralai / mistral-finetune
☆2,990Updated 10 months ago
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,068Updated last week
circlemind-ai / fast-graphrag
RAG that intelligently adapts to your use case, data, and queries
☆3,409Updated last month
SylphAI-Inc / AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
☆3,474Updated this week
guardrails-ai / guardrails
Adding guardrails to large language models.
☆5,332Updated 2 weeks ago
truefoundry / cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
☆4,163Updated 5 months ago
567-labs / instructor
structured outputs for llms
☆11,098Updated this week
weaviate / Verba
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
☆7,219Updated 2 weeks ago
langchain-ai / langgraph-studio
Desktop app for prototyping and debugging LangGraph applications locally.
☆3,116Updated last month
Helicone / helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
☆4,234Updated this week
Marker-Inc-Korea / AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
☆4,144Updated 3 weeks ago
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,821Updated this week
Scale3-Labs / langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…
☆993Updated 2 months ago
dottxt-ai / outlines
Structured Outputs
☆12,188Updated this week
ag2ai / ag2
AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ
☆3,113Updated this week