Codev-Bench (Code Development Benchmark), a fine-grained, real-world, repository-level, and developer-centric evaluation framework. Codev-Bench assesses whether a code completion tool can accurately capture a developer's immediate intent and suggest appropriate code snippets across diverse, fine-grained contexts.
☆50Nov 6, 2024Updated last year
Alternatives and similar repositories for Codev-Bench
Users that are interested in Codev-Bench are comparing it to the libraries listed below
Sorting:
- Inference code of Lingma SWE-GPT☆253Dec 2, 2024Updated last year
- ☆25Aug 2, 2025Updated 6 months ago
- ☆12Jan 31, 2024Updated 2 years ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆97Mar 26, 2025Updated 11 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Updated this week
- ☆20Nov 4, 2025Updated 3 months ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆189Aug 16, 2024Updated last year
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- ☆28Nov 10, 2025Updated 3 months ago
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆44Jan 28, 2026Updated last month
- ☆112Jul 17, 2024Updated last year
- ☆159Aug 27, 2024Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Jan 13, 2024Updated 2 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆37Jul 11, 2025Updated 7 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆247Updated this week
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated last year
- Advancing LLM with Diverse Coding Capabilities☆80Jul 25, 2024Updated last year
- ☆32Jun 5, 2025Updated 8 months ago
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Examples for the HEBI Robotics Python API☆14Jan 9, 2026Updated last month
- ☆36May 25, 2023Updated 2 years ago
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆323Dec 18, 2025Updated 2 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆174Aug 15, 2025Updated 6 months ago
- ☆44Jun 24, 2025Updated 8 months ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- 从头开始学习flutter,记录学习flutter的笔记和代码,一场从Android到Flutter的学习之旅!☆10Aug 11, 2019Updated 6 years ago