A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models
☆110Nov 12, 2025Updated 4 months ago
Alternatives and similar repositories for RouterEval
Users that are interested in RouterEval are comparing it to the libraries listed below
Sorting:
- ☆34Dec 19, 2025Updated 3 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆43Sep 22, 2025Updated 6 months ago
- ☆17Apr 11, 2025Updated 11 months ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 5 months ago
- The code of RouterDC☆71Apr 14, 2025Updated 11 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆124Dec 30, 2025Updated 2 months ago
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆47Aug 6, 2025Updated 7 months ago
- 2023年-全国大学生数学建模竞赛-全国一等奖-A题-定日镜场优化设计模型-代码+论文+答辩PPT☆34Aug 2, 2025Updated 7 months ago
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆71Feb 18, 2026Updated last month
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- Pytorch Implementation of Neural Architecture Search with Reinforcement Learning (in dev)☆20Dec 22, 2019Updated 6 years ago
- Squrve is a lightweight yet powerful framework for translating natural language into SQL over complex databases.☆47Feb 23, 2026Updated 3 weeks ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆16Aug 15, 2025Updated 7 months ago
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆62Dec 30, 2025Updated 2 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆38Jun 4, 2025Updated 9 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Daily Chinese tech digest from Karpathy’s 90 curated blogs, with AI ranking, link analysis, and a static web reader. | 基于 Karpathy 精选 90 …☆38Feb 19, 2026Updated last month
- Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用…☆24Apr 14, 2025Updated 11 months ago
- ☆41Jan 4, 2026Updated 2 months ago
- ☆36Feb 12, 2025Updated last year
- 🔥 A curated roadmap to the Efficient VLA landscape. We’re keeping this list live—contribute your latest work!☆92Updated this week
- ☆13Jan 22, 2025Updated last year
- ☆11Dec 31, 2020Updated 5 years ago
- ☆21Feb 28, 2025Updated last year
- ☆11May 19, 2025Updated 10 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆169Jul 15, 2025Updated 8 months ago
- Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]☆18May 12, 2025Updated 10 months ago
- Metadata browser of TREC☆10Mar 9, 2026Updated last week
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 5 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated last year
- Clustered Federated Learning via Gradient-based Partitioning (ICML 2024)☆15Jun 15, 2025Updated 9 months ago
- Implementation for "EpiCoder: Encompassing Diversity and Complexity in Code Generation" (ICML 2025)☆27May 16, 2025Updated 10 months ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- ☆13Jan 7, 2025Updated last year
- VSS: A Storage System for Video Analytics☆13Jul 9, 2021Updated 4 years ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆24Oct 19, 2025Updated 5 months ago