facebookresearch / swe-rl
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆509Updated last month
Alternatives and similar repositories for swe-rl:
Users that are interested in swe-rl are comparing it to the libraries listed below
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆438Updated 3 weeks ago
- ☆519Updated last week
- AWM: Agent Workflow Memory☆262Updated 2 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆685Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆182Updated 2 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆484Updated 2 weeks ago
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆507Updated 2 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆714Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆363Updated 2 weeks ago
- An agent benchmark with tasks in a simulated software company.☆294Updated 2 weeks ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆819Updated 3 weeks ago
- Automatic evals for LLMs☆373Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆331Updated this week
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆305Updated this week
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆440Updated this week
- 🤠 Agent-as-a-Judge and DevAI dataset☆401Updated 3 months ago
- ☆283Updated last month
- Large Reasoning Models☆802Updated 4 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆194Updated 9 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆204Updated last month
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆449Updated last month
- Pretraining code for a large-scale depth-recurrent language model☆745Updated last week
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆182Updated this week
- OLMoE: Open Mixture-of-Experts Language Models☆716Updated last month
- ☆922Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆882Updated last week
- ☆647Updated 3 weeks ago
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆280Updated last month
- Recipes to scale inference-time compute of open models☆1,058Updated 2 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆322Updated this week