A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
☆36Sep 4, 2024Updated last year
Alternatives and similar repositories for DevEval
Users that are interested in DevEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Jul 20, 2024Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆68Aug 15, 2024Updated last year
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆154Dec 25, 2024Updated last year
- ☆60Jun 19, 2024Updated last year
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 2 years ago
- ☆14May 28, 2024Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Mar 6, 2025Updated last year
- A Comprehensive Benchmark for Software Development.☆130May 30, 2024Updated last year
- ☆32Jan 14, 2025Updated last year
- The Infibench variant of bigcode-evaluation-harness --- a framework for the evaluation of autoregressive code generation language models.☆14Oct 19, 2024Updated last year
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- ☆38Mar 5, 2026Updated 2 weeks ago
- [EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation☆59Nov 16, 2023Updated 2 years ago
- ☆14Dec 12, 2023Updated 2 years ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- Artifact for TOSEM Submission: GiantRepair☆13Jun 26, 2024Updated last year
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL …☆12Feb 11, 2026Updated last month
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- ☆11Jul 14, 2024Updated last year
- Include ML DL RL, knowledge and code☆12Feb 12, 2023Updated 3 years ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆39Mar 30, 2025Updated 11 months ago
- Code for EMNLP 2021 Paper "Recall and Learn: A Memory-augmented Solver for Math Word Problems".☆16Oct 20, 2022Updated 3 years ago
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆10Mar 7, 2022Updated 4 years ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆39Jan 7, 2025Updated last year
- ☆18Mar 18, 2024Updated 2 years ago
- SALNet: Semi-supervised Few-Shot Text Classification with Attention-based Lexicon Construction☆24Jun 18, 2022Updated 3 years ago
- 1990–2021년 한국어 신문 사회면 기사의 ○○女·○○男 집계☆17Sep 26, 2023Updated 2 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- ☆15Jul 20, 2025Updated 8 months ago
- ☆24Nov 19, 2024Updated last year
- ☆25Nov 25, 2019Updated 6 years ago
- ☆23Oct 24, 2025Updated 4 months ago
- Benchmark ClassEval for class-level code generation.☆145Oct 24, 2024Updated last year
- Code for ICML2020 "Sequence Generation with Mixed Representations"☆12Jun 27, 2020Updated 5 years ago
- A benchmark for logging statement generation.☆26Nov 3, 2024Updated last year
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆267Oct 30, 2024Updated last year
- 언어와 컴퓨터 (2021학년도 2학기, 서울대학교 언어학과)☆13Aug 16, 2022Updated 3 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 3 years ago