cppminer produces a code2seq compatible datasets from C++ code bases.
☆23Apr 5, 2020Updated 6 years ago
Alternatives and similar repositories for cppminer
Users that are interested in cppminer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for mining of path-based representations of code (and more)☆299Nov 7, 2025Updated 5 months ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆564Jul 12, 2025Updated 8 months ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆32Oct 25, 2023Updated 2 years ago
- Tracking events, CfPs, abstracts, slides, and all other even related things☆22Oct 4, 2019Updated 6 years ago
- Hoppity☆60Nov 25, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)☆92May 20, 2022Updated 3 years ago
- Structured Information on State and Evolution of Dockerfiles - Online Appendix☆10Mar 16, 2018Updated 8 years ago
- The dataset and source code for CugLM☆15Sep 1, 2020Updated 5 years ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 6 years ago
- ☆29Oct 29, 2022Updated 3 years ago
- ☆69May 30, 2025Updated 10 months ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Code Generation as a Dual Task of Code Summarization.☆30Jun 28, 2021Updated 4 years ago
- Sequence-to-Sequence Learning for End-to-End Program Repair (IEEE TSE 2019). Open-science repo. http://arxiv.org/pdf/1901.01808☆86Jun 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Dec 31, 2019Updated 6 years ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Aug 19, 2020Updated 5 years ago
- Neural Variable Renaming for Decompiled Binaries☆44May 4, 2020Updated 5 years ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- ☆18Sep 12, 2019Updated 6 years ago
- Neural Paraphrase Generation based on OpenNMT-py☆12Jan 2, 2018Updated 8 years ago
- ☆13Mar 21, 2019Updated 7 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Jul 21, 2020Updated 5 years ago
- Jess is short for Joern extended by Semantic Slicing. This tool allows you to import C code into a Code Property Graph, and then compute …☆17May 22, 2024Updated last year
- borges collects and stores Git repositories.☆53Oct 11, 2019Updated 6 years ago
- Code and plugin for paper "Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow“☆16Nov 19, 2022Updated 3 years ago
- ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference☆24Mar 27, 2022Updated 4 years ago
- Here, we open source our measurement dataset and source code on IFTTT☆11Oct 23, 2018Updated 7 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆152Apr 7, 2021Updated 5 years ago
- Finding Fix Recommendations for Dockerfiles☆18Sep 26, 2023Updated 2 years ago
- The implementation of the IJCAI 2018 paper: Code Completion with Neural Attention and Pointer Networks☆18Sep 11, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code related to "Learning Continuous Semantic Representations of Symbolic Expressions" project.☆35Dec 8, 2016Updated 9 years ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,144Sep 20, 2023Updated 2 years ago
- 计算机术语库 csv格式☆13Jul 25, 2017Updated 8 years ago
- A dataset for natural language code search.☆14Feb 13, 2020Updated 6 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆169Dec 26, 2021Updated 4 years ago
- JEMMA: An Extensible Java dataset for Many ML4Code Applications☆19Dec 12, 2022Updated 3 years ago
- The dataset in the paper "Detecting '0-Day' Vulnerability: An Empirical Study of Secret Security Patch in OSS", which appears in the 2019…☆13Aug 9, 2023Updated 2 years ago