aorwall/moatless-tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aorwall/moatless-tools)

aorwall / moatless-tools

☆641

Alternatives and similar repositories for moatless-tools

Users that are interested in moatless-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aorwall / moatless-tree-search
View on GitHub
☆141Jun 6, 2025Updated last year
OpenAutoCoder / Agentless
View on GitHub
Agentless🐱: an agentless approach to automatically solve software development problems
☆2,085Dec 22, 2024Updated last year
aorwall / SWE-bench-docker
View on GitHub
☆106Jul 17, 2024Updated 2 years ago
NL2Code / CodeR
View on GitHub
☆158Aug 27, 2024Updated last year
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Aider-AI / aider-swe-bench
View on GitHub
Harness used to benchmark aider against SWE Bench benchmarks
☆87Jun 27, 2024Updated 2 years ago
ozyyshr / RepoGraph
View on GitHub
Enhancing AI Software Engineering with Repository-level Code Graph
☆289Apr 1, 2025Updated last year
aorwall / moatless-testbeds
View on GitHub
Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…
☆14Apr 9, 2025Updated last year
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
AutoCodeRoverSG / auto-code-rover
View on GitHub
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…
☆3,096Apr 24, 2025Updated last year
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆709Jul 29, 2025Updated 11 months ago
SWE-bench / experiments
View on GitHub
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆274Mar 29, 2026Updated 3 months ago
facebookresearch / swe-rl
View on GitHub
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆712Mar 16, 2025Updated last year
Leolty / repobench
View on GitHub
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆214Aug 16, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SWE-bench / SWE-bench
View on GitHub
SWE-bench: Can Language Models Resolve Real-world Github Issues?
☆5,482Apr 1, 2026Updated 3 months ago
R2E-Gym / R2E-Gym
View on GitHub
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆309Jul 13, 2025Updated last year
RepoUnderstander / RepoUnderstander
View on GitHub
Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)
☆97Mar 26, 2025Updated last year
SWE-bench / SWE-smith
View on GitHub
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆710Updated this week
SWE-agent / SWE-agent
View on GitHub
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersec…
☆19,905Updated this week
r2e-project / r2e
View on GitHub
[ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment
☆149Apr 20, 2025Updated last year
InternLM / SWE-Fixer
View on GitHub
☆139May 8, 2025Updated last year
codestoryai / sidecar
View on GitHub
Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine
☆604May 14, 2025Updated last year
FloridSleeves / LLMDebugger
View on GitHub
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)
☆587Sep 10, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
multi-swe-bench / multi-swe-bench
View on GitHub
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
☆354Dec 18, 2025Updated 7 months ago
amazon-science / cceval
View on GitHub
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆181Aug 15, 2025Updated 11 months ago
SWE-agent / SWE-ReX
View on GitHub
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆555Updated this week
logic-star-ai / swt-bench
View on GitHub
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
☆85Updated this week
xingyaoww / code-act
View on GitHub
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…
☆1,691May 23, 2024Updated 2 years ago
OpenDevin / OD-SWE-bench
View on GitHub
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆30May 26, 2024Updated 2 years ago
commit-0 / commit0
View on GitHub
Commit0: Library Generation from Scratch
☆189Feb 24, 2026Updated 5 months ago
nuprl / CanItEdit
View on GitHub
Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
☆50Sep 13, 2025Updated 10 months ago
mariushobbhahn / SWEBench-verified-mini
View on GitHub
☆38Jan 8, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
FudanSELab / Agent4SE-Paper-List
View on GitHub
Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.
☆553Mar 16, 2025Updated last year
cslsolow / SWE-Exp
View on GitHub
SWE-Exp: Experience-Driven Software Issue Resolution
☆41Oct 17, 2025Updated 9 months ago
lingxi-agent / Lingxi
View on GitHub
☆256Apr 7, 2026Updated 3 months ago
openai / SWELancer-Benchmark
View on GitHub
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…
☆1,435Jul 18, 2025Updated last year
Aider-AI / refactor-benchmark
View on GitHub
Aider's refactoring benchmark exercises based on popular python repos
☆87Oct 10, 2024Updated last year
ADaM-BJTU / O1-CODER
View on GitHub
AN O1 REPLICATION FOR CODING
☆332Dec 11, 2024Updated last year
JetBrains-Research / EnvBench
View on GitHub
[DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup
☆38Nov 9, 2025Updated 8 months ago