[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆59Jul 21, 2025Updated 8 months ago
Alternatives and similar repositories for SWE-Dev
Users that are interested in SWE-Dev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆49Jan 28, 2026Updated 2 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆116Jan 23, 2025Updated last year
- A pytorch implementation of Abstract Syntax Networks☆12Jun 27, 2025Updated 9 months ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- CVE-Factory☆78Mar 27, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- On demand communication☆33Mar 3, 2026Updated last month
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆330Dec 18, 2025Updated 3 months ago
- Make Agent CLI is a powerful command-line tool designed to streamline the management and deployment of AI agents across multiple chains. …☆15Sep 3, 2025Updated 7 months ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆50Aug 26, 2024Updated last year
- OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection(ICCAD 2024)☆29Oct 20, 2024Updated last year
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environment☆146Apr 20, 2025Updated 11 months ago
- Reproducing R1 for Code with Reliable Rewards☆302May 5, 2025Updated 11 months ago
- ☆13Mar 5, 2025Updated last year
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- CAN Bus Voltage Dataset for the SIMPLE paper☆11Oct 2, 2019Updated 6 years ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆102Aug 25, 2025Updated 7 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆169Oct 11, 2024Updated last year
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- ☆41Jun 19, 2024Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- ☆10Jan 28, 2024Updated 2 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.☆29May 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- ☆33Jan 10, 2026Updated 3 months ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- ☆14Jan 8, 2025Updated last year
- Automated Capability Discovery via Foundation Model Self-Exploration☆67Feb 12, 2025Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Toy implementation of Strawberry☆33Sep 24, 2024Updated last year
- Python powered music controlling webpage with websockets and bottle py (works with spotify, vlc, audacious, and others)☆11Jun 9, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆492Jan 3, 2026Updated 3 months ago
- ☆109Jul 15, 2025Updated 8 months ago
- A simple & powerful danmaku framework.☆14Mar 17, 2023Updated 3 years ago
- ☆25Sep 30, 2025Updated 6 months ago
- Code Snippet Recommendation from Stack Overflow Post☆19Jun 30, 2021Updated 4 years ago
- AIDE: the Machine Learning CodeGen Agent☆25Oct 7, 2024Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆698Jan 20, 2025Updated last year