Official code space for "SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development"
☆61Oct 24, 2025Updated 6 months ago
Alternatives and similar repositories for SWE-Dev
Users that are interested in SWE-Dev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆43Apr 10, 2026Updated last month
- ☆22May 3, 2025Updated last year
- 🧬 Python code that implements the active finite Voronoi (AFV) model.☆21May 2, 2026Updated last week
- ☆50Oct 28, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 9 months ago
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 6 months ago
- ☆13Mar 5, 2025Updated last year
- ☆16Jul 26, 2023Updated 2 years ago
- Cosmos-Transfer1-7B-Sample-AV Toolkits☆46Jun 11, 2025Updated 10 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- ☆12Jan 31, 2024Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Structured workflows for AI agents. Guides Claude Code, Cursor, Codex, and other AI assistants through deliberate software development wi…☆27Oct 8, 2025Updated 7 months ago
- ☆128Apr 29, 2026Updated last week
- ☆13Feb 10, 2021Updated 5 years ago
- ☆234Jul 25, 2025Updated 9 months ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- Golang SDK for TRON blockchain☆19Mar 18, 2025Updated last year
- A Benchmark for Multi-Stage Legal Case Documents Generation☆17Feb 24, 2025Updated last year
- ☆14Apr 1, 2024Updated 2 years ago
- The Agentic Developer Environment to orchestrate Claude Code, Codex, Copilot, Cursor, Opencode.☆73May 1, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆41Jul 21, 2024Updated last year
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆72Apr 2, 2025Updated last year
- Empirical quantitative marketing models for PhD level marketing students☆35May 12, 2017Updated 8 years ago
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated 2 years ago
- ☆26Mar 4, 2026Updated 2 months ago
- Course project. A implementation of Graph Wavelet Neural Network (ICLR 2019)☆11Jan 6, 2020Updated 6 years ago
- [ICLR2023] Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning (https://arxiv.org/abs/2210.0022…☆39Jan 30, 2023Updated 3 years ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last month
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated 3 weeks ago
- Nonlinear Granger causality inference with neural networks for high-resolution mass spectrometry☆14Oct 31, 2021Updated 4 years ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆29Aug 18, 2025Updated 8 months ago
- ☆10Apr 17, 2024Updated 2 years ago
- Using KAG and RAG Approaches to Enhance an AI-Powered Cryptocurrency Trading Agent☆28Jan 19, 2025Updated last year
- Example dialogs to get your creative juices flowing☆42Mar 31, 2026Updated last month
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago