Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev-Bench assesses whether a code completion tool can accurately capture a developer's immediate intent and suggest appropriate code snippets across diverse, fine-grained contexts.
☆50Nov 6, 2024Updated last year
Alternatives and similar repositories for Codev-Bench
Users that are interested in Codev-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference code of Lingma SWE-GPT☆255Dec 2, 2024Updated last year
- ☆25Aug 2, 2025Updated 8 months ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- CodeRepoQA dataset☆15Feb 19, 2025Updated last year
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆196Aug 16, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation ICML_2023☆13Oct 27, 2023Updated 2 years ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Feb 26, 2026Updated last month
- ☆28Nov 10, 2025Updated 5 months ago
- A log compression tool (ASE2024)☆16Apr 15, 2025Updated 11 months ago
- ☆28Oct 2, 2025Updated 6 months ago
- Self-Acceleration of CodeLlama for Code Generation☆47Sep 4, 2023Updated 2 years ago
- Advancing LLM with Diverse Coding Capabilities☆80Jul 25, 2024Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆259Mar 29, 2026Updated last week
- ☆17Jul 22, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Ling-Coder-Lite is a MoE LLM provided and open-sourced by CodeFuse and InclusionAI.☆14Apr 22, 2025Updated 11 months ago
- ☆16Mar 16, 2024Updated 2 years ago
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- ☆80Mar 6, 2026Updated last month
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆177Aug 15, 2025Updated 7 months ago
- Large Language Models Meet NL2Code: A Survey☆35Nov 19, 2024Updated last year
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆49Jan 28, 2026Updated 2 months ago
- ☆63Dec 6, 2024Updated last year
- ☆13Mar 5, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Mar 16, 2024Updated 2 years ago
- ☆44Jun 24, 2025Updated 9 months ago
- ☆113Jul 17, 2024Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Oct 4, 2022Updated 3 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- A simple example of how to run Pytest on GitHub Actions☆18Dec 10, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- ☆33Jun 5, 2025Updated 10 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆330Dec 18, 2025Updated 3 months ago
- 中文心理健康对话大模型 PsycoLLM☆68Aug 22, 2025Updated 7 months ago
- Quickly generate shell.nix files once you have a working shell☆33Aug 2, 2025Updated 8 months ago
- ☆10Oct 22, 2024Updated last year
- ☆45Aug 31, 2025Updated 7 months ago