JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for jemma
Users that are interested in jemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆18Nov 12, 2022Updated 3 years ago
- ☆24Dec 16, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆59Sep 27, 2024Updated last year
- ☆15Oct 2, 2024Updated last year
- ☆11Jul 20, 2021Updated 4 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- Towards converting multilingual source code into one language-agnostic graph representation.☆48Mar 22, 2023Updated 3 years ago
- ☆127Apr 22, 2023Updated 3 years ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆76Jul 16, 2022Updated 3 years ago
- ReadMe++: A Multi-domain Multilingual Dataset for Readability Assessment☆12Apr 15, 2025Updated last year
- This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: …☆14Aug 5, 2024Updated last year
- A set of tools for extracting tokens and ASTs from code☆22Jun 5, 2018Updated 7 years ago
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆58Jul 31, 2025Updated 9 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- Training language models to make programs faster☆98Apr 16, 2024Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- Code for "Typilus: Neural Type Hints" PLDI 2020☆62Feb 8, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 31, 2019Updated 6 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 7 months ago
- ☆20Mar 6, 2023Updated 3 years ago
- Dynamic Spectral Graph Anomaly Detection accepted by AAAI2025☆22Apr 12, 2025Updated last year
- ☆15Oct 26, 2021Updated 4 years ago
- Using SVF in Python Projects☆19Updated this week
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- Probabilistic Type Inference using Graph Neural Networks☆50Dec 9, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- This repo will contain replication package for the paper "Feeding Trees to Transformers for Code Completion"☆99Jun 3, 2022Updated 3 years ago
- Code and data for paper "Detecting Code Clones with Graph Neural Network and Flow-Augmented Abstract Syntax Tree".☆70Feb 13, 2022Updated 4 years ago
- This tool is a Program Dependence Graph generator for a given input file in the programming language Java that can be outputed as a dot f…☆23Dec 29, 2019Updated 6 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆156Dec 25, 2024Updated last year
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆58Mar 20, 2024Updated 2 years ago