Code for generating the JuICe dataset.
☆37Oct 27, 2021Updated 4 years ago
Alternatives and similar repositories for JuICe
Users that are interested in JuICe are comparing it to the libraries listed below
Sorting:
- ☆21Oct 6, 2021Updated 4 years ago
- Django Dataset for Code Translation Tasks☆31Feb 21, 2018Updated 8 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆18Mar 28, 2022Updated 3 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- Code for "CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning" (WWW 2019)☆37Apr 21, 2020Updated 5 years ago
- Code and data for ACL20 paper "Incorporating External Knowledge through Pre-training for Natural Language to Code Generation"☆97Sep 22, 2025Updated 5 months ago
- ☆10Jul 27, 2020Updated 5 years ago
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 4 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- ☆12Jun 8, 2021Updated 4 years ago
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Apr 22, 2022Updated 3 years ago
- Models and datasets for annotated code search.☆35May 22, 2023Updated 2 years ago
- ☆15Oct 26, 2021Updated 4 years ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Nov 26, 2021Updated 4 years ago
- Author implementation of the paper "Span-based Semantic Parsing for Compositional Generalization"☆17Aug 29, 2021Updated 4 years ago
- ☆17Oct 4, 2020Updated 5 years ago
- Codes for "NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer" (ACL 2021 findings)☆15Nov 3, 2021Updated 4 years ago
- Deep learning for time-varying multi-entity datasets☆17May 12, 2018Updated 7 years ago
- NAACL 2018 Tutorial: Modelling Natural Language, Programs, and their Intersection☆101May 31, 2018Updated 7 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 2 weeks ago
- PyTorch library for synthesizing programs from natural language☆18Jul 25, 2024Updated last year
- Code for Neural Execution Engines: Learning to Execute Subroutines☆18Jan 11, 2021Updated 5 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 7 months ago
- A corpus of Python programs annotated with contracts☆25Oct 16, 2025Updated 4 months ago
- Author implementation of the paper "Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing"☆18Nov 2, 2018Updated 7 years ago
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆72Nov 1, 2022Updated 3 years ago
- Sketch Driven Regular Expression Generation.☆17Apr 26, 2023Updated 2 years ago
- "Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020☆42Dec 1, 2020Updated 5 years ago
- Restoring Execution Environments of Jupyter Notebooks☆21May 29, 2023Updated 2 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago
- ☆19Dec 8, 2022Updated 3 years ago
- ☆42Jan 11, 2021Updated 5 years ago
- A toolkit for pre-processing large source code corpora☆45Sep 30, 2022Updated 3 years ago
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".☆267Oct 30, 2024Updated last year
- Official implementation of our work, A Transformer-based Approach for Source Code Summarization [ACL 2020].☆195May 28, 2022Updated 3 years ago