JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
Alternatives and similar repositories for jemma
Users that are interested in jemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning to Recommend Method Names with Global Context☆13Jan 17, 2022Updated 4 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆24Dec 16, 2023Updated 2 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)☆57Sep 27, 2024Updated last year
- ☆15Oct 2, 2024Updated last year
- 基于opentype.js的手写字生成程序☆13Jan 29, 2023Updated 3 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 3 years ago
- My little corner of the internet for writing notes about papers I read☆23Apr 10, 2023Updated 3 years ago
- Towards converting multilingual source code into one language-agnostic graph representation.☆48Mar 22, 2023Updated 3 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆76Jul 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A set of tools for extracting tokens and ASTs from code☆22Jun 5, 2018Updated 7 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆58Jul 31, 2025Updated 8 months ago
- We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…☆26Aug 31, 2022Updated 3 years ago
- Training language models to make programs faster☆98Apr 16, 2024Updated last year
- Datasets is a Java library for conveniently working with machine learning datasets.☆21Jun 19, 2018Updated 7 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Dec 15, 2023Updated 2 years ago
- Code for "Typilus: Neural Type Hints" PLDI 2020☆62Feb 8, 2023Updated 3 years ago
- Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.☆12Jul 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Dec 31, 2019Updated 6 years ago
- [EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'☆27Oct 9, 2023Updated 2 years ago
- The RunBugRun dataset of executable bugs☆24Sep 24, 2025Updated 6 months ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆86Mar 20, 2023Updated 3 years ago
- ☆32Jul 13, 2022Updated 3 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- ☆20Mar 6, 2023Updated 3 years ago
- Dynamic Spectral Graph Anomaly Detection accepted by AAAI2025☆21Apr 12, 2025Updated last year
- ☆15Oct 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- This repo will contain replication package for the paper "Feeding Trees to Transformers for Code Completion"☆99Jun 3, 2022Updated 3 years ago
- SCoPE: Sentence Content Paragraph Embeddings☆18Jul 30, 2019Updated 6 years ago
- This tool is a Program Dependence Graph generator for a given input file in the programming language Java that can be outputed as a dot f…☆23Dec 29, 2019Updated 6 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆154Dec 25, 2024Updated last year