MilkThink-Lab/Awesome-Routing-LLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MilkThink-Lab/Awesome-Routing-LLMs)

MilkThink-Lab / Awesome-Routing-LLMs

A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)

☆154

Alternatives and similar repositories for Awesome-Routing-LLMs

Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MilkThink-Lab / RouterEval
View on GitHub
A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models
☆121Jun 3, 2026Updated last month
ynulihao / LLMRouterBench
View on GitHub
[Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing
☆83Apr 6, 2026Updated 3 months ago
ZhangYiqun018 / Avengers
View on GitHub
[AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
☆46Dec 11, 2025Updated 7 months ago
richardzhuang0412 / EmbedLLM
View on GitHub
Repo for EmbedLLM: Learning Compact Representations of Large Language Models
☆32Sep 25, 2025Updated 9 months ago
MilkThink-Lab / MiniLongBench
View on GitHub
[ACL 25] The Low-cost Long Context Understanding Benchmark for Large Language Models (Outstanding Paper Award)
☆23Jul 30, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yanweiyue / masrouter
View on GitHub
☆131Oct 29, 2025Updated 8 months ago
ulab-uiuc / Router-R1
View on GitHub
[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
☆146Dec 30, 2025Updated 6 months ago
shuhao02 / RouterDC
View on GitHub
The code of RouterDC
☆76Apr 14, 2025Updated last year
ulab-uiuc / GraphRouter
View on GitHub
[ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You
☆74Dec 30, 2025Updated 6 months ago
ZhangYiqun018 / AvengersPro
View on GitHub
[DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing
☆220Dec 11, 2025Updated 7 months ago
junchenzhi / Awesome-LLM-Ensemble
View on GitHub
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
☆252Updated this week
linggm3 / 2022_Shenzhen-Cup-Mathematical-Modeling-Challenge
View on GitHub
2022年-深圳杯数学建模挑战赛-全国第二-B题-基于用电可靠性的配电网规划模型-代码+论文+答辩PPT
☆20Aug 2, 2025Updated 11 months ago
RouteWorks / RouterArena
View on GitHub
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…
☆113Jul 14, 2026Updated last week
JL-Cheng / SERE
View on GitHub
[ICLR 2026] SERE: Similarity-Based Expert Re-routing for Efficient Batch Decoding in MoE Models
☆18Feb 4, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lalaliat / Agent-Oriented-Planning
View on GitHub
☆26Feb 28, 2025Updated last year
iNLP-Lab / reading-group
View on GitHub
☆18Jun 17, 2026Updated last month
EIT-NLP / BLEUless_DocMT
View on GitHub
☆14Nov 19, 2024Updated last year
JeffreyYou / Self_Website
View on GitHub
⚛ My self website built with react.js
☆26Feb 22, 2024Updated 2 years ago
zou-group / metatextgrad
View on GitHub
metaTextGrad: Automatically optimizing language model optimizers. Published in NeurIPS 2025.
☆15Nov 5, 2025Updated 8 months ago
linggm3 / 2023_CUMCM_National-First-Prize
View on GitHub
2023年-全国大学生数学建模竞赛-全国一等奖-A题-定日镜场优化设计模型-代码+论文+答辩PPT
☆40Aug 2, 2025Updated 11 months ago
ypw0102 / BatchEval
View on GitHub
code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation
☆19May 20, 2024Updated 2 years ago
michaelzhiluo / starburst
View on GitHub
Burstable Cloud Scheduler
☆17Jun 6, 2024Updated 2 years ago
George-Ling3 / BLUE
View on GitHub
BLUE: Toward Better Language Use in Efficient Vision-Language-Action Models for Autonomous Driving
☆77Jun 16, 2026Updated last month
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
EIT-NLP / Connector-Selection-for-MLLM
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…
☆17Dec 13, 2024Updated last year
shuyuehu / anti-laodeng
View on GitHub
anti-老登，反登味的飞书机器人。拒绝内耗，从我做起，让职场再无登味
☆29Apr 14, 2026Updated 3 months ago
ulab-uiuc / LLMRouter
View on GitHub
LLMRouter: An Open-Source Library for LLM Routing
☆2,152Jul 13, 2026Updated last week
Jianguo99 / Awesome-Diffusion-LLM
View on GitHub
A Collection of Papers on Diffusion Large Language Models
☆49May 12, 2026Updated 2 months ago
Zengwh02 / GlimpRouter
View on GitHub
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
☆16Apr 24, 2026Updated 2 months ago
bangx7 / code_aesthetics
View on GitHub
Official repository for paper: Code Aesthetics with Agentic Reward Feedback
☆17Jan 27, 2026Updated 5 months ago
ViktorAxelsen / BudgetMem
View on GitHub
[ICML'26] Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
☆21Jun 10, 2026Updated last month
ynulihao / SP2000
View on GitHub
Catalogue of Life toolkit for Python
☆11Aug 4, 2020Updated 5 years ago
yanweiyue / AgentPrune
View on GitHub
☆137Mar 23, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
InternScience / EcoClaw
View on GitHub
EcoClaw: Save 90%+ on LLM Costs for OpenClaw with One Plugin
☆29Apr 3, 2026Updated 3 months ago
fscdc / Awesome-Efficient-Reasoning-Models
View on GitHub
[TMLR 2025] Efficient Reasoning Models: A Survey
☆314Jun 26, 2026Updated 3 weeks ago
microsoft / TestExplora
View on GitHub
This is an official code for the paper: TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation
☆27Mar 26, 2026Updated 3 months ago
whyNLP / PCCoT
View on GitHub
Parallel Continuous Chain-of-Thought with Jacobi Iteration. Accepted to EMNLP 2025.
☆23Mar 29, 2026Updated 3 months ago
staymylove / COT_Compresstion_via_Step_entropy
View on GitHub
☆27Aug 8, 2025Updated 11 months ago
vllm-project / semantic-router
View on GitHub
Intelligent Mixture-of-Models Router for Efficient Heterogeneous LLMs Inference
☆5,034Updated this week
alibaba / alibaba-lingjun-dataset-2023
View on GitHub
☆67Jun 25, 2024Updated 2 years ago