SWE-agent / SWE-ReXLinks
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆309Updated last week
Alternatives and similar repositories for SWE-ReX
Users that are interested in SWE-ReX are comparing it to the libraries listed below
Sorting:
- Scaling Data for SWE-agents☆399Updated this week
- ☆593Updated 2 weeks ago
- Agent computer interface for AI software engineer.☆109Updated last week
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆210Updated this week
- Coding problems used in aider's polyglot benchmark☆179Updated 8 months ago
- A benchmark for LLMs on complicated tasks in the terminal☆691Updated this week
- ☆111Updated 3 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆213Updated 5 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆541Updated last month
- ☆53Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆182Updated 6 months ago
- ☆99Updated last year
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆595Updated 6 months ago
- An agent benchmark with tasks in a simulated software company.☆546Updated 3 weeks ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆295Updated last week
- Open-source resources on agents for computer use.☆369Updated 7 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆113Updated 9 months ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆404Updated this week
- Harness used to benchmark aider against SWE Bench benchmarks☆75Updated last year
- Inference-time scaling for LLMs-as-a-judge.☆293Updated 2 weeks ago
- ☆133Updated 5 months ago
- AWM: Agent Workflow Memory☆316Updated 7 months ago
- A framework for optimizing DSPy programs with RL☆172Updated this week
- Commit0: Library Generation from Scratch☆162Updated 4 months ago
- ☆159Updated last year
- Tutorial for building LLM router☆226Updated last year
- Beating the GAIA benchmark with Transformers Agents. 🚀☆135Updated 6 months ago
- r2e: turn any github repository into a programming agent environment☆129Updated 4 months ago
- A Text-Based Environment for Interactive Debugging☆262Updated this week
- ☆204Updated last year