[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning
☆116Dec 30, 2025Updated last month
Alternatives and similar repositories for Router-R1
Users that are interested in Router-R1 are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆61Dec 30, 2025Updated last month
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆32Feb 1, 2026Updated 3 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 4 months ago
- documentation used in my projects☆16Updated this week
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 9 months ago
- ☆19Mar 3, 2025Updated 11 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Feb 7, 2026Updated 3 weeks ago
- ☆17Aug 5, 2025Updated 6 months ago
- ☆49Aug 14, 2025Updated 6 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 5 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated 2 months ago
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- A simple, interactive web tool to compare pricing and performance metrics of various AI models.☆16Feb 20, 2026Updated last week
- Tutorial for TikZ☆11Apr 3, 2025Updated 10 months ago
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 8 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 2 weeks ago
- ☆16Jul 1, 2025Updated 7 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆27Oct 20, 2025Updated 4 months ago
- A wrapper around libssh2 for .NET☆29Jan 21, 2026Updated last month
- Personal Finance Expense Tracker☆19Nov 14, 2025Updated 3 months ago
- 🕷️ n8n Community Node for Scrappey API – Automate web scraping and data extraction with advanced anti-bot blocking technology, seamlessl…☆16Feb 2, 2026Updated 3 weeks ago
- A powerful, interactive Python CLI for converting, manipulating, and inspecting media files using FFmpeg 🎬☆17Feb 10, 2026Updated 2 weeks ago
- ☆16Jun 10, 2025Updated 8 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Feb 9, 2026Updated 2 weeks ago
- Внедрение в инструменты BPM (Business Process Management software tools моделирования верхнеуровневых и детальных процессов) и EA (от биз…☆17Feb 15, 2026Updated last week
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆29Dec 24, 2025Updated 2 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- Camera app drawn on SkiaSharp canvas with real-time SKSL shaders. Built-in desktop shader editor. Made with DrawnUI for .NET MAUI.☆22Feb 20, 2026Updated last week
- Visual image composition helper node for ComfyUI. Grid, diagonals, Phi Grid, Pyramid, Golden Triangles, Perspective lines. Color settings…☆16Jul 10, 2025Updated 7 months ago
- Agent-Driven Software Development Lifecycle (AD-SDLC) system built with Claude Agent SDK☆28Updated this week
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆22Oct 22, 2025Updated 4 months ago
- Compare Naive Bayes, SVM, XGBoost, Bagging, AdaBoost, K-Nearest Neighbors, Random Forests for classification of Malaria Cells☆11Jun 5, 2019Updated 6 years ago
- This repo documents my workflows and stack to run comfy ui GenANI assist under windows☆30Feb 14, 2026Updated last week
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Jan 30, 2026Updated 3 weeks ago
- CLI that syncs Cursor rules into Claude Code’s CLAUDE.md☆22Jun 28, 2025Updated 7 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆59Updated this week