Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev-Bench assesses whether a code completion tool can accurately capture a developer's immediate intent and suggest appropriate code snippets across diverse, fine-grained contexts.
☆50Nov 6, 2024Updated last year
Alternatives and similar repositories for Codev-Bench
Users that are interested in Codev-Bench are comparing it to the libraries listed below
Sorting:
- Inference code of Lingma SWE-GPT☆254Dec 2, 2024Updated last year
- ☆12Jan 31, 2024Updated 2 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆192Aug 16, 2024Updated last year
- ☆28Nov 10, 2025Updated 4 months ago
- ☆77Mar 6, 2026Updated 2 weeks ago
- ☆10Oct 7, 2024Updated last year
- ☆28Oct 2, 2025Updated 5 months ago
- Self-Acceleration of CodeLlama for Code Generation☆47Sep 4, 2023Updated 2 years ago
- Advancing LLM with Diverse Coding Capabilities☆80Jul 25, 2024Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆255Feb 27, 2026Updated 3 weeks ago
- ☆159Aug 27, 2024Updated last year
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆47Jan 28, 2026Updated last month
- CodeDream Online Judge☆12Jan 1, 2023Updated 3 years ago
- Large Language Models Meet NL2Code: A Survey☆35Nov 19, 2024Updated last year
- ☆63Dec 6, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- ☆113Jul 17, 2024Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆66Jun 17, 2025Updated 9 months ago
- ☆11Jul 25, 2020Updated 5 years ago
- Official implementation of our paper: "RAMBO: Enhancing RAG-based Repository-Level Method Body Completion"☆15Dec 22, 2025Updated 2 months ago
- ☆12Oct 29, 2022Updated 3 years ago
- A simple example of how to run Pytest on GitHub Actions☆18Dec 10, 2023Updated 2 years ago
- [ASE'23] When Less is Enough: Positive-Unlabeled Learning Model for Vulnerability Detection☆16Jan 12, 2024Updated 2 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- ☆10Nov 15, 2020Updated 5 years ago
- 中文心理健康对话大模型 PsycoLLM☆64Aug 22, 2025Updated 6 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆326Dec 18, 2025Updated 3 months ago
- Implementation of the ConvS2S architecture using TensorFlow. Also includes the BiConvS2S for bidirectional sequence-to-sequence generatio…☆10May 14, 2019Updated 6 years ago
- https://github.com/PRBonn/kiss-icp☆11Dec 6, 2022Updated 3 years ago
- [TOSEM 2026]A Systematic Literature Review on Large Language Models for Automated Program Repair☆232Mar 13, 2026Updated last week
- ☆44Aug 31, 2025Updated 6 months ago
- DataSet and source code for PyART☆11Nov 27, 2022Updated 3 years ago