JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for jemma
Users that are interested in jemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆18Nov 12, 2022Updated 3 years ago
- ☆25Dec 16, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆60Sep 27, 2024Updated last year
- ☆15Oct 2, 2024Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 4 years ago
- My little corner of the internet for writing notes about papers I read☆25Apr 10, 2023Updated 3 years ago
- 基于opentype.js的手写字生成程序☆26Jan 29, 2023Updated 3 years ago
- Towards converting multilingual source code into one language-agnostic graph representation.☆48Mar 22, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆127Apr 22, 2023Updated 3 years ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 7 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆77Jul 16, 2022Updated 3 years ago
- A set of tools for extracting tokens and ASTs from code☆22Jun 5, 2018Updated 8 years ago
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆58Jul 31, 2025Updated 11 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- Training language models to make programs faster☆101Apr 16, 2024Updated 2 years ago
- Datasets is a Java library for conveniently working with machine learning datasets.☆21Jun 19, 2026Updated last week
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆253Dec 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Typilus: Neural Type Hints" PLDI 2020☆63Feb 8, 2023Updated 3 years ago
- Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.☆12Jul 18, 2022Updated 3 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆28Oct 9, 2023Updated 2 years ago
- The RunBugRun dataset of executable bugs☆25Sep 24, 2025Updated 9 months ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆85Mar 20, 2023Updated 3 years ago
- ☆32Jul 13, 2022Updated 3 years ago
- ☆20Mar 6, 2023Updated 3 years ago
- Dynamic Spectral Graph Anomaly Detection accepted by AAAI2025☆22Apr 12, 2025Updated last year
- ☆15Oct 26, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆14May 19, 2022Updated 4 years ago
- A tag recommender based on SOTA machine learning algorithms to automatically recommending tags to software repositories.☆20May 24, 2022Updated 4 years ago
- Probabilistic Type Inference using Graph Neural Networks☆49Dec 9, 2022Updated 3 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- This repo will contain replication package for the paper "Feeding Trees to Transformers for Code Completion"☆100Jun 3, 2022Updated 4 years ago
- Code and data for paper "Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree".☆70Feb 13, 2022Updated 4 years ago
- This tool is a Program Dependence Graph generator for a given input file in the programming language Java that can be outputed as a dot f…☆23Dec 29, 2019Updated 6 years ago