JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for jemma
Users that are interested in jemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆18Nov 12, 2022Updated 3 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆57Sep 27, 2024Updated last year
- ☆16Oct 2, 2024Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- 基于opentype.js的手写字生成程序☆13Jan 29, 2023Updated 3 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- Towards converting multilingual source code into one language-agnostic graph representation.☆48Mar 22, 2023Updated 3 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- Improving Code Readability Classification using Convolutional Neural Networks☆10Apr 18, 2018Updated 7 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆76Jul 16, 2022Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- Training language models to make programs faster☆98Apr 16, 2024Updated last year
- Datasets is a Java library for conveniently working with machine learning datasets.☆21Jun 19, 2018Updated 7 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- Code for "Typilus: Neural Type Hints" PLDI 2020☆62Feb 8, 2023Updated 3 years ago
- Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.☆12Jul 18, 2022Updated 3 years ago
- Using SVF in Python Projects☆15Updated this week
- ☆11Dec 31, 2019Updated 6 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 5 months ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Mar 20, 2023Updated 3 years ago
- ☆32Jul 13, 2022Updated 3 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- ☆20Mar 6, 2023Updated 3 years ago
- Dynamic Spectral Graph Anomaly Detection accepted by AAAI2025☆21Apr 12, 2025Updated 11 months ago
- ☆15Oct 26, 2021Updated 4 years ago
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- A tag recommender based on SOTA machine learning algorithms to automatically recommending tags to software repositories.☆20May 24, 2022Updated 3 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- This repo will contain replication package for the paper "Feeding Trees to Transformers for Code Completion"☆99Jun 3, 2022Updated 3 years ago
- Code and data for paper "Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree".☆70Feb 13, 2022Updated 4 years ago
- SCoPE: Sentence Content Paragraph Embeddings☆18Jul 30, 2019Updated 6 years ago
- ☆24Jan 19, 2022Updated 4 years ago