microsoft/FEA-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/FEA-Bench)

microsoft / FEA-Bench

[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

☆57

Alternatives and similar repositories for FEA-Bench

Users that are interested in FEA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
microsoft / SWE-bench-Live
View on GitHub
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
☆209Jun 11, 2026Updated last month
seketeam / DevEval
View on GitHub
A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
☆41Sep 4, 2024Updated last year
mariushobbhahn / SWEBench-verified-mini
View on GitHub
☆38Jan 8, 2025Updated last year
YerbaPage / Awesome-Repo-Level-Code-Generation
View on GitHub
Must-read papers on Repository-level Code Generation & Issue Resolution 🔥
☆318Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SWE-rebench / SWE-bench-fork
View on GitHub
Fork to run instances from SWE-rebench
☆30Jun 3, 2026Updated last month
yingweima2022 / CodeLLM
View on GitHub
☆12Jan 31, 2024Updated 2 years ago
THUDM / SWE-Dev
View on GitHub
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆62Jul 21, 2025Updated last year
InternLM / SWE-Fixer
View on GitHub
☆139May 8, 2025Updated last year
Leolty / repobench
View on GitHub
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆212Aug 16, 2024Updated last year
Maxscha / commitbench
View on GitHub
☆12Mar 15, 2024Updated 2 years ago
codefuse-ai / RepoFuse
View on GitHub
☆66Jan 16, 2025Updated last year
DeepSoftwareAnalytics / swe-factory
View on GitHub
[FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
☆183May 12, 2026Updated 2 months ago
KDEGroup / CodeRAG
View on GitHub
Source code for EMNLP'25 paper "CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completio…
☆23Apr 15, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SWE-bench / SWE-smith
View on GitHub
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
☆709Jul 13, 2026Updated last week
SWE-Perf / SWE-Perf
View on GitHub
☆51Oct 28, 2025Updated 8 months ago
JetBrains-Research / lca-baselines
View on GitHub
Baselines for all tasks from Long Code Arena benchmarks 🏟️
☆38Mar 30, 2025Updated last year
zetang94 / ASE2023_kNM-LM
View on GitHub
This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''
☆14Oct 4, 2023Updated 2 years ago
JesseZZZZZ / RepoZero
View on GitHub
RepoZero: Can LLMs Generate a Code Repository from Scratch? (https://arxiv.org/abs/2605.07122)
☆30Jun 4, 2026Updated last month
SkyworkAI / MindLink
View on GitHub
☆100Aug 8, 2025Updated 11 months ago
waynchi / editbench
View on GitHub
☆31Apr 7, 2026Updated 3 months ago
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenHands / trajectory-visualizer
View on GitHub
☆47Jun 11, 2026Updated last month
ozyyshr / RepoGraph
View on GitHub
Enhancing AI Software Engineering with Repository-level Code Graph
☆288Apr 1, 2025Updated last year
IBM / TDD-Bench-Verified
View on GitHub
TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)
☆33Jun 18, 2026Updated last month
domaineval / DomainEval
View on GitHub
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …
☆13Dec 12, 2024Updated last year
multi-swe-bench / MopenHands
View on GitHub
☆17Apr 9, 2025Updated last year
GMago-LeWay / GECFramework
View on GitHub
A Code System for Grammar Error Correction Method. Code Repo for ACL 24 Main "Detection-Correction Structure via General Language Model f…
☆24Sep 17, 2024Updated last year
Hambaobao / SWE-Flow
View on GitHub
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner
☆39Jun 29, 2025Updated last year
nju-websoft / DraCo
View on GitHub
Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)
☆34Mar 24, 2025Updated last year
aorwall / moatless-tree-search
View on GitHub
☆141Jun 6, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
BloodArena / BloodArena
View on GitHub
☆25Apr 26, 2026Updated 2 months ago
zchuz / TimeBench
View on GitHub
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"
☆36Jun 29, 2024Updated 2 years ago
zsworld6 / projdevbench
View on GitHub
☆23May 7, 2026Updated 2 months ago
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
zhenyuhe00 / SWE-Swiss
View on GitHub
SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution
☆105Sep 24, 2025Updated 9 months ago
RUCAIBox / SWE-World
View on GitHub
☆49Mar 6, 2026Updated 4 months ago
multimodal-art-projection / NL2RepoBench
View on GitHub
☆144May 13, 2026Updated 2 months ago