facebookresearch / swe-rl
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆517Updated 2 months ago
Alternatives and similar repositories for swe-rl
Users that are interested in swe-rl are comparing it to the libraries listed below
Sorting:
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆455Updated last week
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆261Updated this week
- ☆527Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆193Updated last week
- [ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆523Updated last week
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆224Updated this week
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆847Updated 2 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆377Updated 2 weeks ago
- AWM: Agent Workflow Memory☆270Updated 3 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆467Updated 2 weeks ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆703Updated last week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆352Updated last week
- Automatic evals for LLMs☆388Updated this week
- ☆181Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆492Updated this week
- ☆691Updated 2 weeks ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆563Updated this week
- ☆291Updated 2 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆143Updated last week
- Verifiers for LLM Reinforcement Learning☆953Updated this week
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆432Updated this week
- Large Reasoning Models☆805Updated 5 months ago
- ⚖️ The First Coding Agent-as-a-Judge☆484Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆363Updated last month
- LIMO: Less is More for Reasoning☆940Updated last month
- Code for the paper 🌳 Tree Search for Language Model Agents☆199Updated 9 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆193Updated this week
- Seed-Coder is a family of open-source code LLMs comprising base, instruct and reasoning models of 8B size, developed by ByteDance Seed.☆183Updated last week
- Understanding R1-Zero-Like Training: A Critical Perspective☆925Updated last month
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆383Updated last week