All-Hands-AI / openhands-resolverLinks
A system that tries to resolve all issues on a github repo with OpenHands.
☆117Updated last year
Alternatives and similar repositories for openhands-resolver
Users that are interested in openhands-resolver are comparing it to the libraries listed below
Sorting:
- Agent computer interface for AI software engineer.☆116Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆424Updated last week
- Harness used to benchmark aider against SWE Bench benchmarks☆79Updated last year
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆84Updated 2 years ago
- Open-source resources on agents for computer use.☆398Updated 3 months ago
- Specification for creating reliable LLM-based conversational agents☆65Updated 3 months ago
- DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.☆185Updated 8 months ago
- Coding problems used in aider's polyglot benchmark☆199Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 6 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆246Updated last week
- Inference-time scaling for LLMs-as-a-judge.☆327Updated 3 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆189Updated last week
- ☆59Updated last year
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- Simple Graph Memory for AI applications☆90Updated 8 months ago
- Sphynx Hallucination Induction☆53Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- Aider's refactoring benchmark exercises based on popular python repos☆78Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆46Updated 3 weeks ago
- Tutorial for building LLM router☆244Updated last year
- ☆137Updated 10 months ago
- ☆80Updated 4 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆302Updated last month
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆93Updated last year
- A toolkit for building computer use AI agents☆182Updated 7 months ago
- Function Calling Benchmark & Testing☆92Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆60Updated 11 months ago