src-d / ml
sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
☆141Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for ml
- Paper reading club at source{d}☆115Updated 4 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆40Updated 6 years ago
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆46Updated 5 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Updated 5 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Updated 5 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆91Updated 5 years ago
- Python library to share machine learning models easily and reliably.☆18Updated 5 years ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- Source code for the Naturalize project☆56Updated 9 years ago
- Lookout Style Analyzer: fixing code formatting and typos during code reviews☆32Updated last year
- A tool for mining commits from Git repositories and diffs to automatically extract code change pattern instances and features with ast a…☆92Updated last week
- Babelfish Python client☆16Updated 5 years ago
- Repository for the code of the "A Convolutional Attention Network for Extreme Summarization of Source Code" paper☆120Updated 8 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Learning to Auto-Complete using RNN Language Models☆156Updated 7 years ago
- Babelfish documentation (GitBook)☆41Updated 5 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Updated 5 years ago
- CodRep 2019 edition.☆20Updated 5 years ago
- Tree-based Autofolding Software Summarization Algorithm☆42Updated 8 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆123Updated 6 months ago
- Demonstration of the path-extraction process shown in the paper "A General Path-Based Representation for Predicting Program Properties"☆24Updated 3 years ago
- Probabilistic API Mining☆53Updated 6 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Updated 4 years ago
- DeepCS: Deep Code Search☆279Updated 2 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆148Updated 3 years ago
- Smelling smells using Deep Learning☆44Updated 3 years ago
- Code completion using machine learning :)☆32Updated 8 years ago
- Website for Learning from "Big Code"☆29Updated 3 years ago
- Finding similar repositories on GitHub☆46Updated last year
- Summarizing Source Code using a Neural Attention Model - CODENN☆236Updated last year