lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆3,635Updated 6 months ago
Alternatives and similar repositories for RouteLLM:
Users that are interested in RouteLLM are comparing it to the libraries listed below
- Deploy your agentic worfklows to production☆1,964Updated this week
- The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆4,880Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,000Updated 3 months ago
- Harness LLMs with Multi-Agent Programming☆3,066Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,879Updated 3 weeks ago
- Knowledge Agents and Management in the Cloud☆3,707Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆2,840Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,672Updated last month
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,396Updated this week
- Optimizing inference proxy for LLMs☆2,040Updated this week
- structured outputs for llms☆9,428Updated this week
- Desktop app for prototyping and debugging LangGraph applications locally.☆2,479Updated 3 weeks ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,455Updated last month
- The official Python SDK for Model Context Protocol servers and clients☆1,888Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,448Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,565Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,485Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,607Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,266Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,959Updated this week
- Chat language model that can use tools and interpret the results☆1,518Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,362Updated last week
- Build and query dynamic, temporally-aware Knowledge Graphs☆1,915Updated last week
- Zep | The Memory Foundation For Your AI Stack☆2,997Updated 2 months ago
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆3,446Updated last week
- A language model programming library.☆5,614Updated this week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,599Updated this week
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆3,215Updated this week
- ☆2,852Updated 5 months ago
- Agentic components of the Llama Stack APIs☆4,140Updated this week