lm-sys / RouteLLMLinks
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,221Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below
Sorting:
- Deploy your agentic worfklows to production☆2,050Updated last week
- Harness LLMs with Multi-Agent Programming☆3,616Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,734Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,341Updated 5 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,082Updated 2 weeks ago
- The Open Source Memory Layer For Autonomous Agents☆2,299Updated 10 months ago
- Knowledge Agents and Management in the Cloud☆4,111Updated this week
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,101Updated last month
- ☆3,005Updated 11 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,955Updated this week
- Adding guardrails to large language models.☆5,489Updated last month
- Optimizing inference proxy for LLMs☆2,766Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,375Updated 3 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,868Updated 7 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,774Updated last week
- Zep | Examples, Integrations, & More☆3,544Updated last week
- A language model programming library.☆5,805Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,640Updated 3 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,377Updated this week
- Chat language model that can use tools and interpret the results☆1,578Updated 2 weeks ago
- Desktop app for prototyping and debugging LangGraph applications locally.☆3,149Updated last month
- Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,…☆7,998Updated last week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆1,006Updated 3 months ago
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆4,358Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,875Updated last week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,803Updated 7 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,579Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,464Updated 3 months ago
- structured outputs for llms☆11,240Updated last week
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆1,161Updated 2 months ago