Aider-AI / polyglot-benchmarkLinks
Coding problems used in aider's polyglot benchmark
☆131Updated 5 months ago
Alternatives and similar repositories for polyglot-benchmark
Users that are interested in polyglot-benchmark are comparing it to the libraries listed below
Sorting:
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆204Updated last week
- Aider's refactoring benchmark exercises based on popular python repos☆73Updated 7 months ago
- Agent computer interface for AI software engineer.☆80Updated this week
- Scaling Data for SWE-agents☆220Updated this week
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated 11 months ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆78Updated last year
- ☆271Updated 2 weeks ago
- proof-of-concept of Cursor's Instant Apply feature☆81Updated 9 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆178Updated last week
- ☆419Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆37Updated this week
- Enhancing AI Software Engineering with Repository-level Code Graph☆179Updated 2 months ago
- A simple Python sandbox for helpful LLM data agents☆264Updated 11 months ago
- ☆157Updated 9 months ago
- The Showdown Computer Control Evaluation Suite☆73Updated 2 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆477Updated 3 weeks ago
- Letting Claude Code develop his own MCP tools :)☆105Updated 2 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆109Updated 6 months ago
- A SQL-like language for efficient code analysis and transformations☆35Updated 4 months ago
- Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine☆563Updated 3 weeks ago
- ☆83Updated last month
- ☆120Updated 5 months ago
- ☆62Updated 2 weeks ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆339Updated this week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆119Updated 2 months ago
- Allows Aider to use CEDARScript as an edit format☆29Updated 6 months ago
- Commit0: Library Generation from Scratch☆149Updated 3 weeks ago
- A framework for optimizing DSPy programs with RL☆58Updated this week
- 🤖 Headless IDE for AI agents☆189Updated last month
- Scrapybara Python SDK☆67Updated last week