JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for jemma
Users that are interested in jemma are comparing it to the libraries listed below
Sorting:
- ☆11Jul 20, 2021Updated 4 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- ☆16Oct 2, 2024Updated last year
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- A set of tools for extracting tokens and ASTs from code☆22Jun 5, 2018Updated 7 years ago
- ☆20Mar 6, 2023Updated 2 years ago
- ☆24Dec 16, 2023Updated 2 years ago
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆57Sep 27, 2024Updated last year
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- Towards converting multilingual source code into one language-agnostic graph representation.☆48Mar 22, 2023Updated 2 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 2 years ago
- ☆18Nov 12, 2022Updated 3 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- The RunBugRun dataset of executable bugs☆23Sep 24, 2025Updated 5 months ago
- Training language models to make programs faster☆98Apr 16, 2024Updated last year
- ☆24Jan 19, 2022Updated 4 years ago
- The code of our paper "Misbehaviour Prediction for Autonomous Driving Systems", including our improved Udacity simulator☆21Jun 30, 2021Updated 4 years ago
- ☆24Oct 15, 2023Updated 2 years ago
- My little corner of the internet for writing notes about papers I read☆22Apr 10, 2023Updated 2 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆58Mar 20, 2024Updated last year
- ☆28Apr 4, 2022Updated 3 years ago
- ☆61Dec 21, 2023Updated 2 years ago
- This repo will contain replication package for the paper "Feeding Trees to Transformers for Code Completion"☆99Jun 3, 2022Updated 3 years ago
- This tool is a Program Dependence Graph generator for a given input file in the programming language Java that can be outputed as a dot f…☆23Dec 29, 2019Updated 6 years ago
- Replication package of a paper "Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction"☆26Sep 7, 2023Updated 2 years ago
- ☆22Nov 17, 2021Updated 4 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- A powerful relational representation of source code☆33Sep 5, 2023Updated 2 years ago
- ☆126Apr 22, 2023Updated 2 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆76Jul 16, 2022Updated 3 years ago
- ☆32Jul 13, 2022Updated 3 years ago
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- A javac plugin for extracting a feature graph for plugging in to machine learning models☆28Jan 20, 2021Updated 5 years ago
- Hoppity☆60Nov 25, 2020Updated 5 years ago
- ☆33Jan 15, 2026Updated last month