NVIDIA-AI-Blueprints / llm-routerLinks
Route LLM requests to the best model for the task at hand.
☆87Updated last month
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below
Sorting:
- Tutorial for building LLM router☆220Updated last year
- Run AI generated code in isolated sandboxes☆90Updated 5 months ago
- Routing on Random Forest (RoRF)☆181Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆138Updated last week
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆65Updated last month
- ☆73Updated 5 months ago
- ☆132Updated 3 months ago
- API Server for Transformer Lab☆69Updated this week
- ☆180Updated 5 months ago
- A Lightweight Library for AI Observability☆249Updated 5 months ago
- ☆160Updated last week
- MCP (Model Context Protocol) server for Weaviate☆139Updated 2 months ago
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆299Updated 3 weeks ago
- Official python implementation of the UTCP☆364Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆438Updated last week
- Verifiers for LLM Reinforcement Learning☆67Updated this week
- Build Research and Rag agents with Granite on your laptop☆139Updated 2 months ago
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆200Updated this week
- ☆166Updated 5 months ago
- Testing and evaluation framework for voice agents☆129Updated 2 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 7 months ago
- ☆89Updated 6 months ago
- Unsloth Studio☆98Updated 3 months ago
- Not Diamond Python SDK☆83Updated last month
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆178Updated 3 months ago
- GraphRAG database - hybrid graph / vector db☆127Updated 10 months ago
- Distributed Inference for mlx LLm☆94Updated last year
- A collection of all available inference solutions for the LLMs☆91Updated 5 months ago
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆43Updated last week
- ☆154Updated last month