Leolty/repobench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Leolty/repobench)

Leolty / repobench

✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024

☆212

Alternatives and similar repositories for repobench

Users that are interested in repobench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / cceval
View on GitHub
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆181Aug 15, 2025Updated 11 months ago
nju-websoft / DraCo
View on GitHub
Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)
☆34Mar 24, 2025Updated last year
amazon-science / cocomic
View on GitHub
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
☆19Feb 20, 2026Updated 5 months ago
Dianshu-Liao / AAA-Code-Generation-Framework-for-Code-Repository-Local-Aware-Global-Aware-Third-Party-Aware
View on GitHub
☆26Dec 16, 2023Updated 2 years ago
DeepSoftwareAnalytics / RLCoder
View on GitHub
Reinforcement Learning for Repository-Level Code Completion
☆43Jun 15, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
amazon-science / mxeval
View on GitHub
☆113Jul 17, 2024Updated 2 years ago
FudanSELab / ClassEval
View on GitHub
Benchmark ClassEval for class-level code generation.
☆151Oct 24, 2024Updated last year
SWE-Gym / SWE-Gym
View on GitHub
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆708Jul 29, 2025Updated 11 months ago
nuprl / CanItEdit
View on GitHub
Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
☆50Sep 13, 2025Updated 10 months ago
qishenghu / InstructCoder
View on GitHub
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆66Oct 4, 2024Updated last year
JetBrains-Research / lca-baselines
View on GitHub
Baselines for all tasks from Long Code Arena benchmarks 🏟️
☆38Mar 30, 2025Updated last year
microsoft / CodeT
View on GitHub
☆678Nov 1, 2024Updated last year
allanj / repo-level-codegen-papers
View on GitHub
Repo-Level Code generation papers
☆235Dec 16, 2025Updated 7 months ago
gonglinyuan / safim
View on GitHub
☆49May 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / cruxeval
View on GitHub
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆171Oct 11, 2024Updated last year
microsoft / FEA-Bench
View on GitHub
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
☆57Jan 28, 2026Updated 5 months ago
shrivastavadisha / repo_level_prompt_generation
View on GitHub
☆127Apr 22, 2023Updated 3 years ago
Aider-AI / aider-swe-bench
View on GitHub
Harness used to benchmark aider against SWE Bench benchmarks
☆87Jun 27, 2024Updated 2 years ago
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
zetang94 / ASE2023_kNM-LM
View on GitHub
This is the official implement for the paper 'Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases''
☆14Oct 4, 2023Updated 2 years ago
nuprl / MultiPL-E
View on GitHub
A multi-programming language benchmark for LLMs
☆312Apr 12, 2026Updated 3 months ago
openai / human-eval-infilling
View on GitHub
Code for the paper "Efficient Training of Language Models to Fill in the Middle"
☆208Apr 2, 2023Updated 3 years ago
evalplus / evalplus
View on GitHub
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
☆1,781Oct 2, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
InternLM / SWE-Fixer
View on GitHub
☆139May 8, 2025Updated last year
seketeam / EvoCodeBench
View on GitHub
An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories
☆71Aug 15, 2024Updated last year
bigcode-project / bigcode-evaluation-harness
View on GitHub
A framework for the evaluation of autoregressive code generation language models.
☆1,052Jul 22, 2025Updated last year
kwaipilot / SWE-Compass
View on GitHub
☆18Mar 28, 2026Updated 3 months ago
codefuse-ai / Awesome-Code-LLM
View on GitHub
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
☆3,412May 20, 2026Updated 2 months ago
bytedance / FullStackBench
View on GitHub
Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"
☆122May 7, 2025Updated last year
aorwall / moatless-tools
View on GitHub
☆641Sep 1, 2025Updated 10 months ago
CoderEval / CoderEval
View on GitHub
A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.
☆158Dec 25, 2024Updated last year
FSoft-AI4Code / RepoHyper
View on GitHub
[FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository
☆74Sep 3, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
microsoft / ReACC
View on GitHub
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆67Apr 18, 2022Updated 4 years ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
bigcode-project / selfcodealign
View on GitHub
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆323Feb 24, 2025Updated last year
SalesforceAIResearch / CodeChain
View on GitHub
Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"
☆49Jun 2, 2026Updated last month
Hambaobao / SWE-Flow
View on GitHub
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner
☆40Jun 29, 2025Updated last year
LingmaTongyi / Codev-Bench
View on GitHub
Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev…
☆48Nov 6, 2024Updated last year
open-compass / DevEval
View on GitHub
A Comprehensive Benchmark for Software Development.
☆135May 30, 2024Updated 2 years ago