lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆3,729Updated 7 months ago
Alternatives and similar repositories for RouteLLM:
Users that are interested in RouteLLM are comparing it to the libraries listed below
- Deploy your agentic worfklows to production☆1,981Updated 2 weeks ago
- Harness LLMs with Multi-Agent Programming☆3,171Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,473Updated last week
- RAG that intelligently adapts to your use case, data, and queries☆3,042Updated 3 weeks ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆5,735Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,041Updated 5 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,957Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,328Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,961Updated last month
- Desktop app for prototyping and debugging LangGraph applications locally.☆2,630Updated last week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,703Updated 2 months ago
- Knowledge Agents and Management in the Cloud☆3,791Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,608Updated this week
- Build and query dynamic, temporally-aware Knowledge Graphs☆2,478Updated this week
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,045Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,603Updated this week
- The easiest way to use Agentic RAG in any enterprise☆4,157Updated 2 months ago
- Optimizing inference proxy for LLMs☆2,110Updated this week
- ☆2,889Updated 6 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including O…☆4,083Updated this week
- High-performance retrieval engine for unstructured data☆1,272Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,955Updated this week
- structured outputs for llms☆9,860Updated this week
- AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ☆2,071Updated this week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,572Updated 3 months ago
- The LLM Evaluation Framework☆5,681Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,884Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,802Updated 2 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,002Updated last week
- The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.☆1,770Updated last month