NVIDIA-AI-Blueprints / llm-routerLinks
Route LLM requests to the best model for the task at hand.
☆137Updated 3 weeks ago
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below
Sorting:
- Self-host LLMs with vLLM and BentoML☆161Updated last week
- Tutorial for building LLM router☆236Updated last year
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆192Updated 7 months ago
- ☆79Updated 2 months ago
- A collection of all available inference solutions for the LLMs☆93Updated 9 months ago
- ☆120Updated last week
- ☆111Updated 2 weeks ago
- Benchmark and optimize LLM inference across frameworks with ease☆141Updated 2 months ago
- ☆268Updated last week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆730Updated this week
- ☆234Updated last week
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆289Updated this week
- A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating System…☆139Updated this week
- Run AI generated code in isolated sandboxes☆126Updated 10 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 2 months ago
- ScalarLM - a unified training and inference stack☆93Updated 2 weeks ago
- ☆182Updated 9 months ago
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆278Updated last month
- ☆266Updated 5 months ago
- ☆176Updated this week
- Agent computer interface for AI software engineer.☆114Updated 2 months ago
- Routing on Random Forest (RoRF)☆226Updated last year
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- Ranking LLMs on agentic tasks☆200Updated 2 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆450Updated 3 months ago
- MCP (Model Context Protocol) server for Weaviate☆159Updated 6 months ago
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆68Updated 2 months ago
- Sample code and application showcases to get you going with AG2 (formally AutoGen)☆195Updated last month