OpenDevin / OD-SWE-benchLinks
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆25Updated last year
Alternatives and similar repositories for OD-SWE-bench
Users that are interested in OD-SWE-bench are comparing it to the libraries listed below
Sorting:
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated last year
- Agent computer interface for AI software engineer.☆88Updated 2 weeks ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- 🧠 Societies of Mind & Economy of Minds☆61Updated 4 months ago
- LangChain + LiteLLM that works☆44Updated last month
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆40Updated 3 weeks ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆54Updated 4 months ago
- Open Agent Computer Interface☆75Updated 7 months ago
- Cerebrum: Agent SDK for AIOS☆72Updated last month
- ☆16Updated 6 months ago
- The first autonomous computer program that can do anything to earn money without human operators.☆109Updated 4 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆13Updated 3 months ago
- ☆20Updated last year
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆79Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆27Updated 8 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆52Updated this week
- ☆29Updated last year
- Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Hu…☆20Updated last month
- Aider's refactoring benchmark exercises based on popular python repos☆75Updated 9 months ago
- Task management for AI agents☆15Updated 2 weeks ago
- Run SWE-bench evaluations remotely☆27Updated 2 weeks ago
- ☆62Updated this week
- Cognition's results and methodology on SWE-bench☆121Updated last year
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆29Updated last month
- CLI that uses DSPy to interact with MCP servers.☆18Updated 4 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆49Updated 9 months ago
- A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and func…☆28Updated 9 months ago
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Updated last year
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆46Updated this week
- GPT4 based personalized ArXiv paper assistant bot☆10Updated last year