NVIDIA-AI-Blueprints / llm-routerLinks

Route LLM requests to the best model for the task at hand.

☆87

Alternatives and similar repositories for llm-router

Users that are interested in llm-router are comparing it to the libraries listed below

Sorting:

anyscale / llm-router
Tutorial for building LLM router
☆220Updated last year
substratusai / sandboxai
Run AI generated code in isolated sandboxes
☆90Updated 5 months ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆181Updated 10 months ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆138Updated last week
i-am-bee / beeai-code-interpreter
An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.
☆65Updated last month
LLMSELECTOR / LLMSELECTOR
☆73Updated 5 months ago
remichu-ai / gallama
☆132Updated 3 months ago
transformerlab / transformerlab-api
API Server for Transformer Lab
☆69Updated this week
philschmid / mcp-openai-gemini-llama-example
☆180Updated 5 months ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆249Updated 5 months ago
langchain-ai / langchain-nvidia
☆160Updated last week
weaviate / mcp-server-weaviate
MCP (Model Context Protocol) server for Weaviate
☆139Updated 2 months ago
e2b-dev / mcp-server
Giving Claude ability to run code with E2B via MCP (Model Context Protocol)
☆299Updated 3 weeks ago
universal-tool-calling-protocol / python-utcp
Official python implementation of the UTCP
☆364Updated this week
vllm-project / guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
☆438Updated last week
menloresearch / verifiers-deepresearch
Verifiers for LLM Reinforcement Learning
☆67Updated this week
ibm-granite-community / granite-retrieval-agent
Build Research and Rag agents with Granite on your laptop
☆139Updated 2 months ago
IBM / prompt-declaration-language
Prompt Declaration Language (PDL) is a declarative prompt programming language.
☆200Updated this week
agora-protocol / paper-demo
☆166Updated 5 months ago
saharmor / voice-lab
Testing and evaluation framework for voice agents
☆129Updated 2 months ago
kolenaIO / autoarena
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆105Updated 7 months ago
tom-doerr / dspy_reasoning
☆89Updated 6 months ago
unslothai / unsloth-studio
Unsloth Studio
☆98Updated 3 months ago
Not-Diamond / notdiamond-python
Not Diamond Python SDK
☆83Updated last month
NVIDIA / nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
☆178Updated 3 months ago
bradAGI / GraphMemory
GraphRAG database - hybrid graph / vector db
☆127Updated 10 months ago
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆94Updated last year
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 5 months ago
Bessouat40 / RAGLight
RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…
☆43Updated last week
QuixiAI / agi-memory
☆154Updated last month