facebookresearch / swe-rlLinks

Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

☆573

Alternatives and similar repositories for swe-rl

Users that are interested in swe-rl are comparing it to the libraries listed below

Sorting:

SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆516Updated last week
SWE-bench / SWE-smith
Scaling Data for SWE-agents
☆328Updated this week
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆679Updated last week
hkust-nlp / CodeIO
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆537Updated 3 months ago
LiveCodeBench / LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
☆608Updated 3 weeks ago
microsoft / rStar
☆608Updated 3 weeks ago
zorazrw / agent-workflow-memory
AWM: Agent Workflow Memory
☆300Updated 6 months ago
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆233Updated 3 months ago
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆430Updated 2 months ago
BytedTsinghua-SIA / MemAgent
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆548Updated this week
mlfoundations / evalchemy
Automatic evals for LLMs
☆496Updated last month
multi-agent-systems-failure-taxonomy / MAST
☆248Updated 2 weeks ago
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆232Updated 2 months ago
bigcode-project / bigcodebench
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
☆409Updated 3 months ago
ServiceNow / AgentLab
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…
☆372Updated this week
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆344Updated 3 weeks ago
SalesforceAIResearch / xLAM
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆513Updated this week
CharlesQ9 / Alita
☆761Updated 2 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆425Updated last week
NVIDIA / NeMo-Skills
A project to improve skills of large language models
☆501Updated this week
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated 2 weeks ago
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆248Updated 2 months ago
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆543Updated this week
openai / mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
☆823Updated last month
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆428Updated 2 months ago
TheAgentCompany / TheAgentCompany
An agent benchmark with tasks in a simulated software company.
☆515Updated last week
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆804Updated 8 months ago
ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆336Updated 7 months ago
multi-swe-bench / multi-swe-bench
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
☆226Updated last week
SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆324Updated last month