OpenDevin / OD-SWE-benchLinks

Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.

☆25

Alternatives and similar repositories for OD-SWE-bench

Users that are interested in OD-SWE-bench are comparing it to the libraries listed below

Sorting:

Aider-AI / aider-swe-bench
Harness used to benchmark aider against SWE Bench benchmarks
☆72Updated last year
All-Hands-AI / openhands-aci
Agent computer interface for AI software engineer.
☆88Updated 2 weeks ago
codestoryai / swe_bench_traces
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
metauto-ai / NLSOM
🧠 Societies of Mind & Economy of Minds
☆61Updated 4 months ago
stanford-oval / chainlite
LangChain + LiteLLM that works
☆44Updated last month
The-Swarm-Corporation / swarms-cloud
Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.
☆40Updated 3 weeks ago
sony / talkhier
Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"
☆54Updated 4 months ago
simular-ai / OpenACI
Open Agent Computer Interface
☆75Updated 7 months ago
agiresearch / Cerebrum
Cerebrum: Agent SDK for AIOS
☆72Updated last month
OSU-NLP-Group / SeeActChromeExtension
☆16Updated 6 months ago
James4Ever0 / agi_computer_control
The first autonomous computer program that can do anything to earn money without human operators.
☆109Updated 4 months ago
aorwall / moatless-testbeds
Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…
☆13Updated 3 months ago
OS-Copilot / FRIDAY-front
☆20Updated last year
codestoryai / prompts
Contains the prompts we use to talk to various LLMs for different utilities inside the editor
☆79Updated last year
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆27Updated 8 months ago
agential-ai / agential
🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
☆52Updated this week
OS-Copilot / FRIDAY-Gizmos
☆29Updated last year
phil65 / llmling-agent
Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Hu…
☆20Updated last month
Aider-AI / refactor-benchmark
Aider's refactoring benchmark exercises based on popular python repos
☆75Updated 9 months ago
agentsea / taskara
Task management for AI agents
☆15Updated 2 weeks ago
SWE-bench / sb-cli
Run SWE-bench evaluations remotely
☆27Updated 2 weeks ago
lm-sys / lm-sys.github.io
☆62Updated this week
CognitionAI / devin-swebench-results
Cognition's results and methodology on SWE-bench
☆121Updated last year
PrimeIntellect-ai / prime-cli
The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers
☆29Updated last month
shane-kercheval / mcp-client-agent
CLI that uses DSPy to interact with MCP servers.
☆18Updated 4 months ago
catena-labs / moa-llm
A Python library to orchestrate LLMs in a neural network-inspired structure
☆49Updated 9 months ago
shoutsid / townhall
A Python-based chatbot project built on the autogen and tinygrad foundation, utilizing advanced agents for dynamic conversations and func…
☆28Updated 9 months ago
AGiXT / hub
Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.
☆17Updated last year
THUDM / SWE-Dev
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆46Updated this week
stanford-oval / gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
☆10Updated last year