All-Hands-AI / openhands-resolverLinks
A system that tries to resolve all issues on a github repo with OpenHands.
☆108Updated 7 months ago
Alternatives and similar repositories for openhands-resolver
Users that are interested in openhands-resolver are comparing it to the libraries listed below
Sorting:
- Agent computer interface for AI software engineer.☆85Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆228Updated last week
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated 11 months ago
- ☆50Updated 3 weeks ago
- Scaling Data for SWE-agents☆256Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆38Updated 3 weeks ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆186Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆80Updated 3 months ago
- AGI SDK☆60Updated last week
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆98Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- ☆127Updated 3 months ago
- Scale your LLM-as-a-judge.☆240Updated 2 weeks ago
- ☆64Updated last month
- ☆51Updated 3 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 11 months ago
- DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.☆177Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 6 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆184Updated 2 months ago
- ☆27Updated 3 weeks ago
- Run SWE-bench evaluations remotely☆21Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆82Updated 8 months ago
- ☆41Updated 4 months ago
- ☆89Updated last week
- ⚖️ Awesome LLM Judges ⚖️☆105Updated last month
- Code for ScribeAgent paper☆58Updated 3 months ago
- Open Agent Computer Interface☆73Updated 7 months ago
- ☆97Updated 11 months ago