BorealisAI / code-gen-TAELinks
Code generation from natural language with less prior and more monolingual data
☆13Updated 4 years ago
Alternatives and similar repositories for code-gen-TAE
Users that are interested in code-gen-TAE are comparing it to the libraries listed below
Sorting:
- Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“☆63Updated 3 years ago
- Source Code for ACL-21 main conference paper "CoSQA: 20,000+ Web Queries for Code Search and Question Answering".☆45Updated 2 years ago
- ☆45Updated 2 months ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆55Updated last year
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Updated 2 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Updated 8 months ago
- Code for the ICLR 2019 paper "Learning to Represent Edits"☆12Updated 2 years ago
- Dataset and code for Findings of EMNLP'21 paper "CodeQA: A Question Answering Dataset for Source Code Comprehension".☆41Updated last year
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15Updated 3 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Updated 3 years ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆114Updated last year
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Updated last year
- A Tree-Based Transformer Architecture for Code Generation. (AAAI'20)☆91Updated 3 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Updated 2 years ago
- Contests based Dataset for Code Generation☆13Updated 2 years ago
- Replication package for EMNLP2022 paper- RACE: Retrieval-Augmented Commit Message Generation☆18Updated 2 years ago
- ☆46Updated 3 years ago
- Submission to ICLR☆47Updated 2 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆21Updated 2 years ago
- This repo is the benchmark for source code summarization on C language☆26Updated 4 years ago
- The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap betw…☆12Updated 10 months ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆98Updated 2 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)☆41Updated 4 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Updated 2 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- ☆11Updated 5 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Updated 2 years ago