Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.
☆531Mar 16, 2025Updated 11 months ago
Alternatives and similar repositories for Agent4SE-Paper-List
Users that are interested in Agent4SE-Paper-List are comparing it to the libraries listed below
Sorting:
- A continuously updated collection of CodeLLM papers maintained by PurCL group @ Purdue☆606Jan 14, 2026Updated last month
- The First International Workshop on Large Language Model for Code 2024 (Co-Located with ICSE 2024)☆17Oct 4, 2024Updated last year
- [TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair☆232Updated this week
- Large Language Models for Software Engineering: A Systematic Literature Review☆104Dec 8, 2025Updated 2 months ago
- [ISSTA 2025] A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing☆13Feb 9, 2025Updated last year
- Evaluation code of ASE24 accepted paper "On the Evaluation of LLM in Unit Test Generation"☆13Dec 9, 2024Updated last year
- Benchmark ClassEval for class-level code generation.☆145Oct 24, 2024Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,010Dec 22, 2024Updated last year
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆3,242Feb 1, 2026Updated last month
- [SCIS 2025] A Survey on Large Language Models for Software Engineering☆313Feb 6, 2025Updated last year
- Large Language Models for Software Engineering☆259Jul 24, 2025Updated 7 months ago
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆731Nov 6, 2025Updated 4 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆252Apr 1, 2025Updated 11 months ago
- This is an evaluation set for the problem of directed/targeted test input generation. We use it to benchmark the ability of Large Languag…☆34Mar 11, 2025Updated 11 months ago
- ☆105Sep 12, 2024Updated last year
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆75May 3, 2024Updated last year
- WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)☆79Aug 5, 2025Updated 7 months ago
- ☆28Nov 10, 2025Updated 3 months ago
- ☆12Mar 5, 2025Updated last year
- ☆516Jan 10, 2024Updated 2 years ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆678Mar 16, 2025Updated 11 months ago
- [SOSP'25] Automatic checker synthesis for system-level static analysis☆167Oct 26, 2025Updated 4 months ago
- ☆19Jul 26, 2023Updated 2 years ago
- [DL4C @ ICLR 2025] A Benchmark for Automated Environment Setup☆34Nov 9, 2025Updated 3 months ago
- Fast and Precise On-the-fly Patch Validation for All☆10Feb 24, 2023Updated 3 years ago
- Repilot, a patch generation tool introduced in the ESEC/FSE'23 paper "Copiloting the Copilots: Fusing Large Language Models with Completi…☆136Oct 9, 2023Updated 2 years ago
- [ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training☆11Sep 13, 2024Updated last year
- A manually vetted dataset for security vulnerability detection in Java projects☆92Aug 12, 2025Updated 6 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆644Jul 29, 2025Updated 7 months ago
- ☆13Nov 20, 2024Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆174Aug 15, 2025Updated 6 months ago
- ☆628Sep 1, 2025Updated 6 months ago
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆165Feb 25, 2026Updated last week
- ☆45Dec 12, 2024Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆67Aug 15, 2024Updated last year
- EvoEval: Evolving Coding Benchmarks via LLM☆81Apr 6, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆290May 5, 2025Updated 10 months ago
- ☆44Jun 24, 2025Updated 8 months ago