SWE-Gym / SWE-Bench-ForkLinks
☆12Updated 10 months ago
Alternatives and similar repositories for SWE-Bench-Fork
Users that are interested in SWE-Bench-Fork are comparing it to the libraries listed below
Sorting:
- ☆56Updated last year
- ☆31Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆150Updated 4 months ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 9 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27Updated 8 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 8 months ago
- NaturalCodeBench (Findings of ACL 2024)☆69Updated last year
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆161Updated last week
- ☆32Updated this week
- ☆50Updated 11 months ago
- ☆51Updated last year
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Updated last year
- e☆43Updated 9 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆101Updated last week
- ☆59Updated 3 weeks ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆64Updated last year
- ☆89Updated 3 months ago
- ☆28Updated 2 months ago
- Training and Benchmarking LLMs for Code Preference.☆37Updated last year
- ☆23Updated last year
- ☆72Updated 7 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Updated 7 months ago
- ☆53Updated 11 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Updated 3 weeks ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆23Updated 2 months ago
- ☆25Updated 9 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 5 months ago
- ☆46Updated 3 months ago