facebookresearch / swe-rl
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆485Updated 2 weeks ago
Alternatives and similar repositories for swe-rl:
Users that are interested in swe-rl are comparing it to the libraries listed below
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆424Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆342Updated last month
- ☆499Updated last week
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆401Updated last week
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆492Updated last month
- AWM: Agent Workflow Memory☆252Updated 2 months ago
- ☆914Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆465Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning☆548Updated last week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆306Updated 2 weeks ago
- Large Reasoning Models☆799Updated 4 months ago
- ☆268Updated 2 weeks ago
- ☆596Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆161Updated this week
- OLMoE: Open Mixture-of-Experts Language Models☆698Updated 3 weeks ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆388Updated 2 months ago
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆273Updated 3 weeks ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆173Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆312Updated 3 months ago
- Recipes to scale inference-time compute of open models☆1,049Updated last month
- ☆1,013Updated 3 months ago
- Automatic evals for LLMs☆352Updated this week
- An agent benchmark with tasks in a simulated software company.☆274Updated 2 weeks ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆669Updated 2 weeks ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆775Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆657Updated 2 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆340Updated 9 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆437Updated 3 weeks ago
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆252Updated last month
- PyTorch building blocks for the OLMo ecosystem☆186Updated this week