A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models
☆119Jun 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for RouterEval
Users that are interested in RouterEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆143May 24, 2026Updated last month
- [EMNLP 2025 Main] LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL☆76Jun 18, 2025Updated last year
- ☆127Oct 29, 2025Updated 8 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆43Sep 22, 2025Updated 9 months ago
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆34Feb 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code of RouterDC☆75Apr 14, 2025Updated last year
- Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""☆62May 12, 2026Updated last month
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆141Dec 30, 2025Updated 6 months ago
- multi-agent crafter for cooperative tasks☆14Aug 2, 2025Updated 10 months ago
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆63Apr 8, 2026Updated 2 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆167Jun 13, 2024Updated 2 years ago
- [CVPR26] Official code for GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristic☆91Mar 24, 2026Updated 3 months ago
- 2022年-深圳杯数学建模挑战赛-全国第二-B题-基于用电可靠性的配电网规划模型-代码+论文+答辩PPT☆20Aug 2, 2025Updated 11 months ago
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆97Jun 23, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 2023年中山大学计算机学院农革老师的计网(计算机网络)实验☆12Mar 1, 2024Updated 2 years ago
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆74Dec 30, 2025Updated 6 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- ☆95Mar 30, 2026Updated 3 months ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆12Apr 27, 2024Updated 2 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆18Aug 15, 2025Updated 10 months ago
- Implementation of "Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines"☆33Feb 24, 2025Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆13Oct 14, 2024Updated last year
- Used for thinking process intervention of reasoning models such as DeepSeek-R1, effectively controlling the reasoning thinking process. 用…☆24Apr 14, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…☆32Jun 14, 2026Updated 2 weeks ago
- Daily Chinese tech digest from Karpathy’s 90 curated blogs, with AI ranking, link analysis, and a static web reader. | 基于 Karpathy 精选 90 …☆39Feb 19, 2026Updated 4 months ago
- ☆18Aug 19, 2024Updated last year
- ☆37Feb 12, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- ☆16Jan 7, 2025Updated last year
- ☆26Feb 28, 2025Updated last year
- ☆11May 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Official implementation of Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting☆24May 3, 2024Updated 2 years ago
- ☆189Jul 15, 2025Updated 11 months ago
- Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]☆18May 12, 2025Updated last year
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Clustered Federated Learning via Gradient-based Partitioning (ICML 2024)☆15Jun 15, 2025Updated last year