A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)
☆136May 24, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Routing-LLMs
Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆115Jun 3, 2026Updated last week
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆64Apr 6, 2026Updated 2 months ago
- ☆14Nov 19, 2024Updated last year
- ☆126Oct 29, 2025Updated 7 months ago
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆71Dec 30, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆23Nov 17, 2025Updated 6 months ago
- Burstable Cloud Scheduler☆17Jun 6, 2024Updated 2 years ago
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆90Jun 4, 2026Updated last week
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Dec 13, 2024Updated last year
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆135Dec 30, 2025Updated 5 months ago
- The code of RouterDC☆75Apr 14, 2025Updated last year
- ⚛ My self website built with react.js☆26Feb 22, 2024Updated 2 years ago
- ☆66Jun 25, 2024Updated last year
- ☆27Apr 23, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆90Mar 30, 2026Updated 2 months ago
- ☆10Feb 21, 2023Updated 3 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆18Aug 15, 2025Updated 9 months ago
- ☆15Jan 24, 2025Updated last year
- ☆25Feb 28, 2025Updated last year
- [ICML 2025] Retraining-Free Merging of Sparse MoE via Hierarchical Clustering☆26Oct 26, 2025Updated 7 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- [KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?☆76Mar 4, 2026Updated 3 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆141Aug 11, 2025Updated 10 months ago
- Implementation to VirtualTaobao☆13Jan 17, 2020Updated 6 years ago
- Metadata browser of TREC☆10May 19, 2026Updated 3 weeks ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", acc…☆217Feb 3, 2026Updated 4 months ago
- An extensive and commented list of resources on Learned Sparse Retrieval.☆61Apr 27, 2026Updated last month
- WraAct is a tool to construct the convex hull of various activation functions.☆33Updated this week
- AgentIR is a retriever specialized for Deep Research agents.☆57Apr 16, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- 基于OpenAI的同声传译应用,包含GUi界面,安卓app,命令行功能☆68Dec 11, 2025Updated 6 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆165Jun 13, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 11 months ago
- Medical Concept Embedding with Multiple Ontological Representations (IJCAI-19)☆10Jul 21, 2020Updated 5 years ago
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Mar 4, 2024Updated 2 years ago
- ☆24May 19, 2026Updated 3 weeks ago