☆132Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for moatless-tree-search
Users that are interested in moatless-tree-search are comparing it to the libraries listed below
Sorting:
- ☆12Nov 5, 2024Updated last year
- ☆628Sep 1, 2025Updated 6 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆97Mar 26, 2025Updated 11 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆252Apr 1, 2025Updated 11 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- ☆132May 8, 2025Updated 9 months ago
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆25Nov 11, 2025Updated 3 months ago
- SWE-Exp: Experience-Driven Software Issue Resolution☆35Oct 17, 2025Updated 4 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆71Jan 15, 2026Updated last month
- Agent computer interface for AI software engineer.☆118Feb 27, 2026Updated last week
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,010Dec 22, 2024Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆251Feb 27, 2026Updated last week
- Landing page + leaderboard for SWE-Bench benchmark☆11Feb 26, 2026Updated last week
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆29Apr 21, 2025Updated 10 months ago
- ☆47Oct 28, 2025Updated 4 months ago
- ☆65Jan 16, 2025Updated last year
- Inference code of Lingma SWE-GPT☆255Dec 2, 2024Updated last year
- Commit0: Library Generation from Scratch☆187Feb 24, 2026Updated last week
- ☆104Jul 17, 2024Updated last year
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆678Mar 16, 2025Updated 11 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆79Jun 27, 2024Updated last year
- A Comprehensive Benchmark for Software Development.☆127May 30, 2024Updated last year
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆259Dec 22, 2025Updated 2 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆584Updated this week
- ☆159Aug 27, 2024Updated last year
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆443Feb 27, 2026Updated last week
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆31Apr 1, 2025Updated 11 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆72Sep 3, 2024Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- ☆12Mar 5, 2025Updated last year
- ☆24Oct 3, 2025Updated 5 months ago
- ☆11Jan 3, 2024Updated 2 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- A Model Context Protocol (MCP) server that provides persistent memory and multi-model LLM support.☆27Jan 3, 2025Updated last year
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆323Dec 18, 2025Updated 2 months ago
- A Model Context Protocol server for Python code analysis with Claude. Again, works with warning now. I'm missing something here.☆12Nov 29, 2025Updated 3 months ago
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- ☆34Jan 25, 2026Updated last month