A set of tools for extracting tokens and ASTs from code
☆22Jun 5, 2018Updated 7 years ago
Alternatives and similar repositories for codemining-core
Users that are interested in codemining-core are comparing it to the libraries listed below
Sorting:
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- Probabilistic Itemset Mining☆19Jun 22, 2016Updated 9 years ago
- code for "Implant Global and Local Hierarchy Information to Sequence based Code Representation Models"☆12Dec 13, 2024Updated last year
- Source code summarization using the Software Word Usage Model (SWUM)☆16Apr 27, 2016Updated 9 years ago
- Repository for Deep API Learning (DeepAPI)☆56Dec 3, 2021Updated 4 years ago
- Tree-based Autofolding Software Summarization Algorithm☆43Jul 30, 2016Updated 9 years ago
- Code and data for the paper "A Neural Architecture for Generating Natural Language Descriptions from Source Code Changes"☆67Apr 18, 2017Updated 8 years ago
- ☆15Jul 27, 2023Updated 2 years ago
- PyTorch library for synthesizing programs from natural language☆18Jul 25, 2024Updated last year
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- Dataset and code corresponding to Associating Natural Language Comment and Source Code Entities (AAAI 2020)☆20Oct 24, 2020Updated 5 years ago
- ☆20Feb 20, 2017Updated 9 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 2 years ago
- Probabilistic API Mining☆53Jan 8, 2018Updated 8 years ago
- [UNMAINTAINED] A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN) for Graph Classification☆20Mar 19, 2019Updated 6 years ago
- ☆25Jul 12, 2017Updated 8 years ago
- Code for "Typilus: Neural Type Hints" PLDI 2020☆62Feb 8, 2023Updated 3 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Tensorflow Implementation of Improving Variational Encoder-Decoders in Dialogue Generation☆27Mar 27, 2018Updated 7 years ago
- A System for Debloating C/C++ Programs☆31Jul 16, 2021Updated 4 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 4 years ago
- A syntactic neural model for parsing natural language to executable code☆186Nov 12, 2022Updated 3 years ago
- Code related to "Learning Continuous Semantic Representations of Symbolic Expressions" project.☆35Dec 8, 2016Updated 9 years ago
- source code for 'Improving automatic source code summarization via deep reinforcement learning'☆78Jun 2, 2021Updated 4 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- Neural Code Translator provides instructions, datasets, and a deep learning infrastructure (based on seq2seq) that aims at learning code …☆38Apr 14, 2019Updated 6 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆85May 10, 2021Updated 4 years ago
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆154Dec 25, 2024Updated last year
- Big Data and Machine Intelligence, Spring 2021.☆12Jul 2, 2021Updated 4 years ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- Structured Information on State and Evolution of Dockerfiles - Online Appendix☆10Mar 16, 2018Updated 7 years ago
- A Python framework that uses machine learning algorithms to implement the metadata recovery attack against obfuscated programs.☆11Jul 25, 2016Updated 9 years ago
- Various Arduino project codes made available by me.☆10Sep 30, 2016Updated 9 years ago
- Makes it easy to convert Python data structures to JSON strings suitable for flot series and options☆25Sep 26, 2012Updated 13 years ago
- src2abs is a tool that abstracts Java source code☆36Apr 10, 2019Updated 6 years ago
- StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Sy…☆172Aug 28, 2021Updated 4 years ago
- ☆11Aug 24, 2024Updated last year
- A library for parsing security advisories☆13Feb 5, 2026Updated 3 weeks ago