src-d / ml
sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
☆141Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for ml
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆40Updated 6 years ago
- Paper reading club at source{d}☆115Updated 4 years ago
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆46Updated 5 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Updated 5 years ago
- Babelfish Python client☆16Updated 5 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆91Updated 5 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Updated 5 years ago
- Learning to Auto-Complete using RNN Language Models☆156Updated 7 years ago
- [ICSE'18] Hierarchical Learning of Cross-Language Mappings through Distributed Vector Representations for Code☆22Updated 6 years ago
- Babelfish documentation (GitBook)☆41Updated 4 years ago
- Tools, services and applications for source code analysis and search☆61Updated 9 years ago
- Source code for the Naturalize project☆56Updated 9 years ago
- Python library to share machine learning models easily and reliably.☆18Updated 5 years ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆48Updated last year
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆323Updated 4 years ago
- CodRep 2019 edition.☆20Updated 4 years ago
- Demonstration of the path-extraction process shown in the paper "A General Path-Based Representation for Predicting Program Properties"☆24Updated 3 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- Repository for the code of the "A Convolutional Attention Network for Extreme Summarization of Source Code" paper☆119Updated 8 years ago
- Lookout Style Analyzer: fixing code formatting and typos during code reviews☆32Updated last year
- Summarizing Source Code using a Neural Attention Model - CODENN☆236Updated last year
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Updated 4 years ago
- A tool for mining commits from Git repositories and diffs to automatically extract code change pattern instances and features with ast a…☆92Updated 5 months ago
- DeepCS: Deep Code Search☆279Updated 2 years ago
- source{d} Community Edition (CE)☆188Updated 5 years ago
- Tree-based Autofolding Software Summarization Algorithm☆42Updated 8 years ago
- ☆50Updated 4 years ago
- Code completion using machine learning :)☆32Updated 8 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Updated 5 years ago