OpenDevin / OD-SWE-bench
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆24Updated 11 months ago
Alternatives and similar repositories for OD-SWE-bench:
Users that are interested in OD-SWE-bench are comparing it to the libraries listed below
- Harness used to benchmark aider against SWE Bench benchmarks☆70Updated 10 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Agent computer interface for AI software engineer.☆63Updated this week
- Aider's refactoring benchmark exercises based on popular python repos☆68Updated 6 months ago
- 🧠 Mindstorm in Natural Language-based Societies of Mind☆57Updated last month
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated 2 weeks ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- ☆155Updated 8 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 10 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆107Updated 5 months ago
- Framework for creating reliable LLM-based conversational agents☆37Updated 3 weeks ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆43Updated last week
- ☆85Updated 2 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- LangChain + LiteLLM that works☆39Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆93Updated 6 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- ☆36Updated 4 months ago
- ☆50Updated 5 months ago
- ☆41Updated 4 months ago
- Open Agent Computer Interface☆64Updated 5 months ago
- ☆37Updated 2 years ago
- ☆91Updated 9 months ago
- ☆40Updated 9 months ago
- accompanying material for sleep-time compute paper☆56Updated this week
- ☆119Updated 8 months ago
- Gentopia Agent Zoo and Agent Benchmark☆30Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 10 months ago
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆19Updated 3 weeks ago