Harness used to benchmark aider against SWE Bench benchmarks
☆80Jun 27, 2024Updated last year
Alternatives and similar repositories for aider-swe-bench
Users that are interested in aider-swe-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆105Jul 17, 2024Updated last year
- ☆637Sep 1, 2025Updated 7 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆261Mar 29, 2026Updated 2 weeks ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- ☆12May 30, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Jul 6, 2023Updated 2 years ago
- A collection of scripts and tools for analyzing SWE agents.☆16May 7, 2025Updated 11 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,036Dec 22, 2024Updated last year
- ☆14Nov 3, 2023Updated 2 years ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆62Jan 28, 2025Updated last year
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 6 months ago
- ☆19Nov 12, 2025Updated 5 months ago
- A simple app to "unfrack" your Cardano Wallet's UTxO state☆16Jan 27, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- ☆17Sep 1, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 3 weeks ago
- ☆68May 20, 2025Updated 10 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆75Dec 26, 2024Updated last year
- Notes and insights about OpenAI's Code Interpreter☆13Jul 26, 2023Updated 2 years ago
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- Reactive DDD with DSPy☆23Feb 24, 2024Updated 2 years ago
- Advancing LLM with Diverse Coding Capabilities☆79Jul 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DSPy Experiments☆10Aug 28, 2025Updated 7 months ago
- Home-Assistant Custom Component to read data from ryd (formaly Tanktaler)☆10Feb 3, 2024Updated 2 years ago
- SWE-bench: Can Language Models Resolve Real-world Github Issues?☆4,676Apr 1, 2026Updated 2 weeks ago
- ☆13Nov 28, 2025Updated 4 months ago
- SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement☆46Apr 5, 2026Updated last week
- ☆28Nov 10, 2025Updated 5 months ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated 3 weeks ago
- One command automated macOS/Linux laptop/VM/container bootstrapper.☆18Apr 8, 2026Updated last week
- VisualChatGPT for googlecolab-version☆22Mar 11, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Apr 15, 2024Updated 2 years ago
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)☆19Apr 6, 2025Updated last year
- ☆132Jun 6, 2025Updated 10 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆134Mar 5, 2026Updated last month
- ☆13Mar 5, 2025Updated last year
- ☆38Apr 8, 2026Updated last week
- A complete version of Uniswap in Plutus.☆12Sep 6, 2021Updated 4 years ago