All-Hands-AI / openhands-resolver
A system that tries to resolve all issues on a github repo with OpenHands.
☆108Updated 5 months ago
Alternatives and similar repositories for openhands-resolver
Users that are interested in openhands-resolver are comparing it to the libraries listed below
Sorting:
- Agent computer interface for AI software engineer.☆73Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆182Updated this week
- Harness used to benchmark aider against SWE Bench benchmarks☆71Updated 10 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆91Updated 7 months ago
- Code for ScribeAgent paper☆57Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- ☆50Updated 5 months ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆76Updated last year
- EcoAssistant: using LLM assistant more affordably and accurately☆132Updated 10 months ago
- Scaling Data for SWE-agents☆160Updated this week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- ☆125Updated last month
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆65Updated 10 months ago
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆160Updated 10 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆171Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆80Updated 7 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆39Updated 2 weeks ago
- AWM: Agent Workflow Memory☆270Updated 3 months ago
- ☆72Updated 6 months ago
- The Showdown Computer Control Evaluation Suite☆70Updated last month
- ☆23Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 2 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆70Updated 4 months ago
- ☆40Updated 9 months ago
- Verdict is a library for scaling judge-time compute.☆211Updated 2 weeks ago
- The first platform designed to empower organizations by automating and enhancing their employment processes through advanced autonomous a…☆39Updated 10 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated last week
- Aider's refactoring benchmark exercises based on popular python repos☆70Updated 7 months ago
- An agent benchmark with tasks in a simulated software company.☆350Updated this week