microsoft / coverage-eval

Dataset with coverage annotations for HumanEval dataset

☆22

Alternatives and similar repositories for coverage-eval:

Users that are interested in coverage-eval are comparing it to the libraries listed below

zhangzwwww / DietCode
☆20Updated 2 years ago
ise-uiuc / xft
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆30Updated 8 months ago
amazon-science / recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
☆51Updated 11 months ago
EngineeringSoftware / CoditT5
CoditT5: Pretraining for Source Code and Natural Language Editing
☆28Updated last month
DeepSoftwareAnalytics / CommitMsgEmpirical
☆28Updated last year
logic-star-ai / swt-bench
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
☆36Updated this week
YihongDong / CDD-TED4LLMs
☆13Updated 3 months ago
rizwan09 / REDCODER
☆42Updated 3 weeks ago
terryyz / DataAug4Code
Source Code Data Augmentation for Deep Learning: A Survey.
☆64Updated 8 months ago
zysszy / TreeGen-Pytorch
☆17Updated 2 years ago
evo-eval / evoeval
EvoEval: Evolving Coding Benchmarks via LLM
☆67Updated 11 months ago
reddy-lab-code-research / XLCoST
Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence
☆68Updated last month
mahimanzum / FixEval
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…
☆22Updated 2 years ago
wasiahmad / AVATAR
Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.
☆53Updated 7 months ago
bdqnghi / awesome-ai4code
A collection of recent papers, benchmarks and datasets of AI4Code domain.
☆57Updated 10 months ago
Gompyn / re2com-opensource
code for "Retrieve and Refine: Exemplar-based Neural Comment Generation"
☆15Updated 3 years ago
gonglinyuan / ast_t5
☆59Updated 10 months ago
crux-eval / eval-arena
☆21Updated 4 months ago
reddy-lab-code-research / CodeAttack
Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models
☆28Updated last year
DeepSoftwareAnalytics / Telly
Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
☆20Updated last year
reddy-lab-code-research / StructCoder
Code for "StructCoder: Structure-Aware Transformer for Code Generation"
☆71Updated last year
justinphan3110 / CoTexT
Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer
☆36Updated 3 years ago
evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆105Updated 4 months ago
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆78Updated 5 months ago
amazon-science / llm-code-preference
Training and Benchmarking LLMs for Code Preference.
☆33Updated 3 months ago
modit-team / MODIT
MODIT: On Multi-Modal Learning of Editing Source Code.
☆20Updated 3 years ago
microsoft / ReACC
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆62Updated 2 years ago
beyondacm / Que2Code
Code Snippet Recommendation from Stack Overflow Post
☆18Updated 3 years ago
RaoNikitha / CAT-LM
☆14Updated last year
coinse / libro
Replication package of a paper "Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction"
☆22Updated last year