sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
☆143May 22, 2019Updated 6 years ago
Alternatives and similar repositories for ml
Users that are interested in ml are comparing it to the libraries listed below
Sorting:
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆49Aug 20, 2019Updated 6 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Oct 17, 2019Updated 6 years ago
- Advanced similarity and duplicate source code at scale.☆57Jun 6, 2019Updated 6 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Oct 30, 2019Updated 6 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- Python library to share machine learning models easily and reliably.☆18Nov 5, 2019Updated 6 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆51Sep 5, 2022Updated 3 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Feb 13, 2019Updated 7 years ago
- Assisted code review, running custom code analyzers on pull requests☆152Aug 3, 2021Updated 4 years ago
- Probabilistic Sequence Mining☆46Apr 25, 2018Updated 7 years ago
- Utilities used by the Deep Program Understanding team☆104Jun 12, 2023Updated 2 years ago
- Paper reading club at source{d}☆116Dec 10, 2019Updated 6 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆39Oct 18, 2018Updated 7 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆343Nov 27, 2019Updated 6 years ago
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Apr 15, 2019Updated 6 years ago
- source{d} extension to match Git signatures to real people.☆17Nov 12, 2019Updated 6 years ago
- Lookout Style Analyzer: fixing code formatting and typos during code reviews☆33Nov 23, 2022Updated 3 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- Software vulnerabilities data set☆25Mar 4, 2020Updated 6 years ago
- code2vec: Learning Distributed Representations of Code☆14Jun 27, 2018Updated 7 years ago
- Tracking events, CfPs, abstracts, slides, and all other even related things☆22Oct 4, 2019Updated 6 years ago
- An IntelliJ IDEA plugin that allows to get suggestions for better method names☆10Dec 4, 2019Updated 6 years ago
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- A simple Pytorch implementation of Gated Graph Neural Networks☆59Sep 18, 2018Updated 7 years ago
- Probabilistic API Mining☆53Jan 8, 2018Updated 8 years ago
- Web client for Babelfish server☆21Dec 9, 2022Updated 3 years ago
- [Deprecated] Source Code Generation using Sequence Generative Adversarial Networks☆75Jan 7, 2017Updated 9 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Structured Information on State and Evolution of Dockerfiles - Online Appendix☆10Mar 16, 2018Updated 8 years ago
- ☆20Nov 6, 2019Updated 6 years ago
- C# Data Extraction for "Learning to Represent Edits"☆27Nov 3, 2018Updated 7 years ago
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 6 years ago
- ☆12May 27, 2021Updated 4 years ago
- ☆10Jul 28, 2022Updated 3 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 4 years ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,143Sep 20, 2023Updated 2 years ago
- gitbase web client; source{d} CE comes with a new UI, check it at https://docs.sourced.tech/community-edition/☆60Oct 9, 2019Updated 6 years ago
- A Source Code Tokenizer☆14Oct 30, 2024Updated last year
- Maximal Divergence Sequential Autoencoder for Binary Software Vulnerability Detection☆21Feb 27, 2019Updated 7 years ago