OpenDevin / OD-SWE-bench
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆23Updated 10 months ago
Alternatives and similar repositories for OD-SWE-bench:
Users that are interested in OD-SWE-bench are comparing it to the libraries listed below
- Harness used to benchmark aider against SWE Bench benchmarks☆67Updated 9 months ago
- Agent computer interface for AI software engineer.☆52Updated this week
- Aider's refactoring benchmark exercises based on popular python repos☆61Updated 5 months ago
- ☆12Updated last year
- ☆50Updated 4 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆10Updated last month
- ☆28Updated 11 months ago
- ☆73Updated 2 months ago
- ☆39Updated 8 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆106Updated 4 months ago
- ☆155Updated 7 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆74Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graph☆149Updated 2 months ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆11Updated last year
- ☆87Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- ☆117Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- An open source ChatGPT UI for ToolLlama☆27Updated last year
- Gentopia Agent Zoo and Agent Benchmark☆30Updated last year
- ☆80Updated last month
- GPT4 based personalized ArXiv paper assistant bot☆10Updated last year
- ☆73Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆99Updated this week
- Challenges for general-purpose web-browsing AI agents☆45Updated last month
- ☆20Updated last year
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆170Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 9 months ago
- 🧠 Mindstorm in Natural Language-based Societies of Mind☆55Updated 2 weeks ago