Harness used to benchmark aider against SWE Bench benchmarks
☆81Jun 27, 2024Updated last year
Alternatives and similar repositories for aider-swe-bench
Users that are interested in aider-swe-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆105Jul 17, 2024Updated last year
- ☆637Sep 1, 2025Updated 8 months ago
- Aider's refactoring benchmark exercises based on popular python repos☆82Oct 10, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.☆30May 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- ☆12May 30, 2025Updated 11 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆273Apr 1, 2025Updated last year
- A collection of scripts and tools for analyzing SWE agents.☆16May 7, 2025Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,042Dec 22, 2024Updated last year
- Run SWE-bench evaluations remotely☆63Aug 14, 2025Updated 8 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆62Jan 28, 2025Updated last year
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 7 months ago
- ☆19Nov 12, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 5 years ago
- ☆17Sep 1, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated last month
- ☆67May 20, 2025Updated 11 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆75Dec 26, 2024Updated last year
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- Reactive DDD with DSPy☆23Feb 24, 2024Updated 2 years ago
- DSPy Experiments☆10Aug 28, 2025Updated 8 months ago
- SWE-bench: Can Language Models Resolve Real-world Github Issues?☆4,831Apr 1, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Nov 28, 2025Updated 5 months ago
- AVATAR: Fixing Semantic Bugs with Fix Patterns of Static Analysis Violations☆26Apr 26, 2021Updated 5 years ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated last month
- One command automated macOS/Linux laptop/VM/container bootstrapper.☆18Apr 29, 2026Updated last week
- VisualChatGPT for googlecolab-version☆22Mar 11, 2023Updated 3 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- Landing page + leaderboard for SWE-Bench benchmark☆12Mar 29, 2026Updated last month
- ☆134Jun 6, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- METR Task Standard☆179Feb 3, 2025Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆136Apr 29, 2026Updated last week
- Replication package of a paper "Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction"☆28Sep 7, 2023Updated 2 years ago
- ☆38Apr 8, 2026Updated 3 weeks ago
- The StreamingGradioCallbackHandler is a custom callback handler that works with Language Models (LLMs) that support streaming. It facilit…☆10Oct 21, 2023Updated 2 years ago
- Agent fixing SWE bench issues☆19May 21, 2024Updated last year
- The source code for paper--MORE: A Metric learning based framework for Open-domain Relation Extraction.☆12Jan 15, 2021Updated 5 years ago