yingweima2022 / SWE-ReasonerLinks
☆25Updated 5 months ago
Alternatives and similar repositories for SWE-Reasoner
Users that are interested in SWE-Reasoner are comparing it to the libraries listed below
Sorting:
- ☆24Updated this week
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆44Updated 6 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆255Updated 4 months ago
- A research repo for experiments about Reinforcement Finetuning☆53Updated 9 months ago
- Reproducing R1 for Code with Reliable Rewards☆278Updated 8 months ago
- ☆57Updated 7 months ago
- ☆326Updated 7 months ago
- ☆217Updated last week
- ☆299Updated 6 months ago
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆67Updated last year
- [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆36Updated 7 months ago
- ☆32Updated 7 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆85Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated last year
- ☆153Updated 7 months ago
- A comprehensive collection of process reward models.☆131Updated 3 months ago
- ☆70Updated 6 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Updated 9 months ago
- Official Repository of "Learning what reinforcement learning can't"☆75Updated last week
- ☆404Updated 2 months ago
- ☆31Updated 8 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆96Updated 2 months ago
- ☆71Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆148Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆71Updated 9 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆33Updated last year
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆176Updated 6 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆64Updated last year
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆61Updated 3 months ago