LightChen233 / reasoning-boundaryView external linksLinks
☆70Jun 18, 2025Updated 7 months ago
Alternatives and similar repositories for reasoning-boundary
Users that are interested in reasoning-boundary are comparing it to the libraries listed below
Sorting:
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 2 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 4 months ago
- Bayes-Adaptive RL for LLM Reasoning☆45May 28, 2025Updated 8 months ago
- ☆20Nov 3, 2024Updated last year
- ☆31Sep 12, 2025Updated 5 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 4 months ago
- ☆88Jun 7, 2024Updated last year
- ☆24Aug 19, 2025Updated 5 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Mar 6, 2025Updated 11 months ago
- ☆29Nov 9, 2025Updated 3 months ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆26May 19, 2024Updated last year
- ☆46Jun 24, 2025Updated 7 months ago
- ☆49Aug 14, 2025Updated 5 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Nov 29, 2024Updated last year
- ☆16Sep 4, 2025Updated 5 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated last year
- ☆12Mar 5, 2025Updated 11 months ago
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Sep 2, 2024Updated last year
- Automating Sub-Agent Creation for Agentic Orchestration☆30Updated this week
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- ☆32Jan 25, 2026Updated 2 weeks ago
- ☆13Sep 12, 2024Updated last year
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆32Mar 2, 2025Updated 11 months ago
- ☆56Aug 10, 2024Updated last year
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆51Nov 9, 2024Updated last year
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- ☆145Sep 12, 2025Updated 5 months ago
- ☆53Feb 11, 2025Updated last year
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Feb 21, 2025Updated 11 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Oct 15, 2025Updated 3 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆23Dec 14, 2025Updated last month
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- ☆26Jun 5, 2025Updated 8 months ago
- ☆11Sep 7, 2024Updated last year
- ☆13Dec 9, 2024Updated last year