A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆5,071Aug 10, 2024Updated last year
Alternatives and similar repositories for RouteLLM
Users that are interested in RouteLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆51,475Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,912Jan 7, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆35,310Jun 18, 2026Updated last week
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.☆23,435May 14, 2026Updated last month
- Universal memory layer for AI Agents☆59,199Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,725Apr 15, 2026Updated 2 months ago
- Structured Outputs☆13,984Jun 19, 2026Updated last week
- structured outputs for llms☆13,210Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆54,270Updated this week
- Build, run, and manage agent platforms.☆40,783Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆83,677Updated this week
- A programming framework for agentic AI☆59,069Apr 15, 2026Updated 2 months ago
- Deploy your agentic worfklows to production☆2,068Apr 6, 2026Updated 2 months ago
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆67,133Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,320Apr 8, 2026Updated 2 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,912Apr 13, 2026Updated 2 months ago
- aider is AI pair programming in your terminal☆46,496May 22, 2026Updated last month
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,892Nov 7, 2025Updated 7 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆29,013Sep 30, 2025Updated 8 months ago
- LlamaIndex is the leading document agent and OCR platform☆50,340Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,636May 23, 2026Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆33,885Updated this week
- Vane is an AI-powered answering engine.☆35,415Apr 11, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,713Jun 8, 2026Updated 2 weeks ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆5,649Mar 19, 2026Updated 3 months ago
- Go ahead and axolotl questions☆12,082Updated this week
- A blazing fast AI Gateway with integrated guardrails. Route to 1,600+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆12,132May 25, 2026Updated last month
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,430Jun 18, 2026Updated last week
- Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist …☆11,199Dec 12, 2024Updated last year
- Supercharge Your LLM Application Evaluations 🚀☆14,523Feb 24, 2026Updated 4 months ago
- PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous self-improving agents that research, plan…☆8,201Updated this week
- 🙌 OpenHands: AI-Driven Development☆78,051Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A guidance language for controlling large language models.☆21,507May 21, 2026Updated last month
- SGLang is a high-performance serving framework for large language models and multimodal models.☆29,460Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,988Sep 5, 2025Updated 9 months ago
- Large Action Model framework to develop AI Web Agents☆6,375Jan 21, 2025Updated last year
- Tutorial for building LLM router☆253Jul 19, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,938May 17, 2025Updated last year
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…☆19,559Jun 17, 2026Updated last week