Harness used to benchmark aider against SWE Bench benchmarks
☆79Jun 27, 2024Updated last year
Alternatives and similar repositories for aider-swe-bench
Users that are interested in aider-swe-bench are comparing it to the libraries listed below
Sorting:
- ☆104Jul 17, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Aider's refactoring benchmark exercises based on popular python repos☆80Oct 10, 2024Updated last year
- ☆628Sep 1, 2025Updated 6 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆251Updated this week
- Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.☆28May 26, 2024Updated last year
- ☆13Nov 28, 2025Updated 3 months ago
- Get the best daily repositories☆10Updated this week
- Download okCupid users public data automatically☆10Feb 6, 2022Updated 4 years ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆252Apr 1, 2025Updated 11 months ago
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- ☆13Jul 6, 2023Updated 2 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Sep 4, 2024Updated last year
- Run SWE-bench evaluations remotely☆58Aug 14, 2025Updated 6 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,010Dec 22, 2024Updated last year
- A simple app to "unfrack" your Cardano Wallet's UTxO state☆16Jan 27, 2025Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- A comprehensive list of AI directories☆23Nov 26, 2025Updated 3 months ago
- Work in progress! I don't recommend looking at the code right now.☆24Dec 3, 2025Updated 3 months ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- agi from function calls, if you want in vscode☆18Dec 18, 2023Updated 2 years ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆84Jan 24, 2024Updated 2 years ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆43Feb 27, 2026Updated last week
- ☆132Jun 6, 2025Updated 9 months ago
- ☆18Apr 15, 2024Updated last year
- ☆19Aug 1, 2024Updated last year
- ☆19Nov 12, 2025Updated 3 months ago
- ☆16Apr 26, 2021Updated 4 years ago
- fork of litellm that is open source☆22Jan 22, 2026Updated last month
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- SWE-bench: Can Language Models Resolve Real-world Github Issues?☆4,385Feb 19, 2026Updated 2 weeks ago
- ☆68May 20, 2025Updated 9 months ago
- An orchestration system for managing AI coding agents. The system uses Aider (an AI coding assistant) to handle coding tasks and provides…☆94Dec 9, 2025Updated 2 months ago
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- Agent computer interface for AI software engineer.☆118Updated this week
- Agential AI with asynchronous function execution in a Redis queue.☆27Jul 24, 2024Updated last year
- The RunBugRun dataset of executable bugs☆23Sep 24, 2025Updated 5 months ago
- AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations☆26Apr 26, 2021Updated 4 years ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆124Aug 26, 2025Updated 6 months ago