NVIDIA-AI-Blueprints / llm-routerLinks
Route LLM requests to the best model for the task at hand.
☆125Updated 2 weeks ago
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below
Sorting:
- ☆79Updated last month
- Self-host LLMs with vLLM and BentoML☆156Updated 3 weeks ago
- Tutorial for building LLM router☆235Updated last year
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆68Updated 2 months ago
- Run AI generated code in isolated sandboxes☆120Updated 9 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆188Updated 6 months ago
- ☆144Updated 3 months ago
- A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating System…☆138Updated 6 months ago
- MCP (Model Context Protocol) server for Weaviate☆157Updated 5 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆89Updated last week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆700Updated this week
- ☆268Updated 4 months ago
- ☆104Updated last week
- AI Assistant running within your browser.☆75Updated 11 months ago
- ☆234Updated 4 months ago
- Agent computer interface for AI software engineer.☆113Updated 2 months ago
- ☆267Updated this week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- ☆216Updated 3 weeks ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆259Updated this week
- Routing on Random Forest (RoRF)☆220Updated last year
- ScalarLM - a unified training and inference stack☆93Updated last week
- A collection of all available inference solutions for the LLMs☆91Updated 8 months ago
- A Text-Based Environment for Interactive Debugging☆276Updated this week
- Benchmark and optimize LLM inference across frameworks with ease☆131Updated 2 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆78Updated last year
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆260Updated this week
- Build Research and Rag agents with Granite on your laptop☆146Updated last month
- Sample code and application showcases to get you going with AG2 (formally AutoGen)☆190Updated 2 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆248Updated last month