A curated list of awesome works in Routing LLMs paradigm (π Welcome to submit your contributions to this code repository)
β143May 24, 2026Updated last month
Alternatives and similar repositories for Awesome-Routing-LLMs
Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Modelsβ119Jun 3, 2026Updated 3 weeks ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Modelsβ32Sep 25, 2025Updated 9 months ago
- A curated list of awesome approaches to AI model routingβ216Mar 24, 2025Updated last year
- β14Nov 19, 2024Updated last year
- β127Oct 29, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan Youβ74Dec 30, 2025Updated 6 months ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". β¦β22Nov 17, 2025Updated 7 months ago
- β33Jan 26, 2026Updated 5 months ago
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodaβ¦β17Dec 13, 2024Updated last year
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learningβ141Dec 30, 2025Updated 6 months ago
- The code of RouterDCβ75Apr 14, 2025Updated last year
- β28Updated this week
- β95Mar 30, 2026Updated 3 months ago
- [ACL 2025 Main] (π Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probaβ¦β18Aug 15, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)β13Dec 12, 2023Updated 2 years ago
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practiβ¦β49Oct 29, 2025Updated 8 months ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scalingβ13Mar 7, 2024Updated 2 years ago
- β17Nov 3, 2024Updated last year
- β15Jan 24, 2025Updated last year
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clusteringβ25Oct 26, 2025Updated 8 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ19May 23, 2025Updated last year
- Implementation to VirtualTaobaoβ13Jan 17, 2020Updated 6 years ago
- Metadata browser of TRECβ10May 19, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"β16Oct 24, 2022Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022β13Nov 25, 2022Updated 3 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"β23Mar 4, 2025Updated last year
- The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", accβ¦β217Feb 3, 2026Updated 4 months ago
- Efficiency/Effectiveness Trade-offs in Learning to Rankβ12Sep 11, 2018Updated 7 years ago
- WraAct is a tool to construct the convex hull of various activation functions.β33Jun 24, 2026Updated last week
- Pretty collections of tools for educational data mining.β11Aug 1, 2021Updated 4 years ago
- AgentIR is a retriever specialized for Deep Research agents.β59Apr 16, 2026Updated 2 months ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Languaβ¦β13Nov 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ24Jun 30, 2025Updated last year
- Medical Concept Embedding with Multiple Ontological Representations (IJCAI-19)β10Jul 21, 2020Updated 5 years ago
- β12Mar 6, 2026Updated 3 months ago
- RL Recommendation Systemβ13Aug 30, 2019Updated 6 years ago
- β32Jan 16, 2025Updated last year
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"β34Feb 19, 2025Updated last year
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIRβ¦β18Feb 19, 2022Updated 4 years ago