[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
β678Mar 16, 2025Updated 11 months ago
Alternatives and similar repositories for swe-rl
Users that are interested in swe-rl are comparing it to the libraries listed below
Sorting:
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ2,010Dec 22, 2024Updated last year
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]β644Jul 29, 2025Updated 7 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agentsβ248Jul 13, 2025Updated 7 months ago
- Reproducing R1 for Code with Reliable Rewardsβ290May 5, 2025Updated 10 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agentsβ584Updated this week
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolutionβ104Sep 24, 2025Updated 5 months ago
- Official Repo for Open-Reasoner-Zeroβ2,087Jun 2, 2025Updated 9 months ago
- β28Nov 10, 2025Updated 3 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ568May 6, 2025Updated 9 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ261May 5, 2025Updated 10 months ago
- SkyRL: A Modular Full-stack RL Library for LLMsβ1,628Feb 26, 2026Updated last week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,522Updated this week
- Democratizing Reinforcement Learning for LLMsβ5,167Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agentsβ585Aug 10, 2025Updated 6 months ago
- β132May 8, 2025Updated 9 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,328May 16, 2025Updated 9 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,085Nov 13, 2025Updated 3 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolvingβ323Dec 18, 2025Updated 2 months ago
- verl: Volcano Engine Reinforcement Learning for LLMsβ19,519Updated this week
- Simple RL training for reasoningβ3,830Dec 23, 2025Updated 2 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,064Jul 30, 2025Updated 7 months ago
- Scalable RL solution for advanced reasoning of language modelsβ1,809Mar 18, 2025Updated 11 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"β68Apr 11, 2025Updated 10 months ago
- Sky-T1: Train your own O1 preview model within $450β3,369Jul 12, 2025Updated 7 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.β443Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,739May 11, 2025Updated 9 months ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β633Jan 29, 2026Updated last month
- This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Eβ¦β1,439Jul 18, 2025Updated 7 months ago
- β1,392Sep 12, 2025Updated 5 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)β9,084Updated this week
- RL Scaling and Test-Time Scaling (ICML'25)β114Jan 23, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ125Jun 11, 2025Updated 8 months ago
- SWE-bench: Can Language Models Resolve Real-world Github Issues?β4,385Feb 19, 2026Updated 2 weeks ago
- s1: Simple test-time scalingβ6,636Jun 25, 2025Updated 8 months ago
- Enhancing AI Software Engineering with Repository-level Code Graphβ252Apr 1, 2025Updated 11 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domainsβ50Feb 4, 2026Updated last month
- β628Sep 1, 2025Updated 6 months ago
- β132Jun 6, 2025Updated 8 months ago
- Fully open data curation for reasoning modelsβ2,218Dec 2, 2025Updated 3 months ago