BorealisAI / code-gen-TAELinks

Code generation from natural language with less prior and more monolingual data

☆13

Alternatives and similar repositories for code-gen-TAE

Users that are interested in code-gen-TAE are comparing it to the libraries listed below

Sorting:

microsoft / ReACC
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆62Updated 3 years ago
Jun-jie-Huang / CoCLR
Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".
☆45Updated 2 years ago
rizwan09 / REDCODER
☆45Updated last month
swtheing / WizardCoder_Instruct_Generator
Generate the WizardCoder Instruct from the CodeAlpaca
☆21Updated 2 years ago
wasiahmad / AVATAR
Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.
☆55Updated last year
zorazrw / multilingual-conala
[EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
☆23Updated 2 years ago
amazon-science / recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
☆52Updated last year
zysszy / TreeGen
A Tree-Based Transformer Architecture for Code Generation. (AAAI'20)
☆91Updated 3 years ago
DeepSoftwareAnalytics / RACE
Replication package for EMNLP2022 paper- RACE: Retrieval-Augmented Commit Message Generation
☆18Updated 2 years ago
17385 / TreeBERT
☆52Updated 3 years ago
ds4an / CoDas4CG
Contests based Dataset for Code Generation
☆13Updated 2 years ago
jadecxliu / CodeQA
Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".
☆41Updated last year
nchen909 / CodeAttention
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022
☆13Updated 2 years ago
Alex-HaochenLi / RACS
[EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'
☆27Updated last year
nxphi47 / tree_transformer
Submission to ICLR
☆47Updated 2 years ago
yuewang-cuhk / awesome-programming-language-pretraining-papers
Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)
☆58Updated 3 years ago
guxd / C-DNPG
Data and code for the paper "Continuous Decomposition of Granularity for Neural Paraphrase Generation"
☆8Updated 2 years ago
shangqing-liu / CCSD-benchmark-for-code-summarization
This repo is the benchmark for source code summarization on C language
☆26Updated 4 years ago
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
GXimingLu / neurologic_decoding
☆82Updated 2 years ago
zkcpku / HiT-hierarchy-transformer
code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"
☆12Updated 7 months ago
krystalan / chatgpt_as_nlg_evaluator
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆43Updated 2 years ago
wasiahmad / PLBART
Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].
☆187Updated 3 years ago
reddy-lab-code-research / MuST-CoST
Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"
☆11Updated 3 years ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆114Updated last year
Bolin0215 / CSCGDual
☆16Updated 5 years ago
microsoft / iclr2019-learning-to-represent-edits
Code for the ICLR 2019 paper "Learning to Represent Edits"
☆12Updated 2 years ago
sriniiyer / concode
Mapping Language to Code in a Programmatic Context
☆80Updated 4 years ago
zfj1998 / CodeBert-Code2Text
基于CodeBert预训练模型，微调后/直接对目标数据集进行测试
☆14Updated 3 years ago
hitz-zentroa / lm-contamination
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆78Updated last year