A collection of practical code generation tasks and tests from open source projects. Complementary to HumanEval by OpenAI.
☆24Jan 28, 2023Updated 3 years ago
Alternatives and similar repositories for CoderEval
Users that are interested in CoderEval are comparing it to the libraries listed below
Sorting:
- Commit changes in a more decent way.☆16May 6, 2023Updated 2 years ago
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆154Dec 25, 2024Updated last year
- A python library to build graphs for programs written in different programming languages.☆13May 6, 2022Updated 3 years ago
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆24Aug 21, 2024Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆68Oct 14, 2024Updated last year
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- public dataset for followup-query analysis, accepted by AAAI2019☆15Aug 22, 2019Updated 6 years ago
- ☆10Feb 8, 2021Updated 5 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- ☆56May 28, 2024Updated last year
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆57Jul 31, 2025Updated 7 months ago
- Aix-bench, the Java benchmark for code synthesis problem.☆51Aug 19, 2022Updated 3 years ago
- ☆17Feb 14, 2024Updated 2 years ago
- ☆22Mar 25, 2025Updated 11 months ago
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆17Oct 16, 2023Updated 2 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 4 years ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆66Jun 17, 2025Updated 9 months ago
- A tqdm bar progress that works with MongoDB instead of console.☆11Feb 21, 2022Updated 4 years ago
- Top level project for CAmkES, a component platform that provides support for developing and building static seL4 systems as a collection …☆23Mar 12, 2026Updated last week
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- This repository is the reproduced code of Neural Responding Machine for Short-Text Conversation (https://www.aclweb.org/anthology/P15-115…☆15Dec 20, 2018Updated 7 years ago
- ☆23Oct 30, 2019Updated 6 years ago
- Code for PII detection and redaction in code datasets☆13Jan 24, 2023Updated 3 years ago
- A Survey of Deep Learning Models for Structural Code Understanding☆21May 12, 2022Updated 3 years ago
- A set of tools for extracting tokens and ASTs from code☆22Jun 5, 2018Updated 7 years ago
- The newest version of PatchNet☆14Nov 25, 2022Updated 3 years ago
- ☆43May 9, 2024Updated last year
- Shire Lang Spring/Java Demo project☆18Jan 14, 2025Updated last year
- Comparing Different Stochastic Gradien Descent implementations in Haskell against Python☆10Jul 25, 2016Updated 9 years ago
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- ☆16Feb 28, 2024Updated 2 years ago
- Code for the curation of The Stack v2 and StarCoder2 training data☆130Apr 11, 2024Updated last year
- Experiments in machine learning on graph databases☆14Feb 6, 2018Updated 8 years ago
- ☆20Aug 9, 2014Updated 11 years ago
- The RunBugRun dataset of executable bugs☆23Sep 24, 2025Updated 5 months ago
- ☆26Nov 26, 2025Updated 3 months ago
- 毕业设计。Keywords: 层次聚类、谱聚类、WordNet☆10Jun 29, 2014Updated 11 years ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago