sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
☆143May 22, 2019Updated 6 years ago
Alternatives and similar repositories for ml
Users that are interested in ml are comparing it to the libraries listed below
Sorting:
- source{d} MLonCode foundation - core algorithms and models.☆14Oct 17, 2019Updated 6 years ago
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆49Aug 20, 2019Updated 6 years ago
- ☆12Nov 17, 2017Updated 8 years ago
- Babelfish Python client☆17Nov 6, 2019Updated 6 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Oct 30, 2019Updated 6 years ago
- Advanced similarity and duplicate source code at scale.☆57Jun 6, 2019Updated 6 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Feb 13, 2019Updated 7 years ago
- Tree-based Autofolding Software Summarization Algorithm☆43Jul 30, 2016Updated 9 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Sep 5, 2022Updated 3 years ago
- code2vec: Learning Distributed Representations of Code☆14Jun 27, 2018Updated 7 years ago
- Source code for the Naturalize project☆56Sep 5, 2015Updated 10 years ago
- Babelfish driver SDK☆23Nov 18, 2019Updated 6 years ago
- Lookout Style Analyzer: fixing code formatting and typos during code reviews☆33Nov 23, 2022Updated 3 years ago
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Apr 15, 2019Updated 6 years ago
- A self-hosted server for source code parsing☆367Updated this week
- Utilities used by the Deep Program Understanding team☆104Jun 12, 2023Updated 2 years ago
- Probabilistic Sequence Mining☆46Apr 25, 2018Updated 7 years ago
- An IntelliJ IDEA plugin that allows to get suggestions for better method names☆10Dec 4, 2019Updated 6 years ago
- Paper reading club at source{d}☆116Dec 10, 2019Updated 6 years ago
- Cool links & research papers related to Machine Learning applied to source code (MLonCode)☆6,524Dec 3, 2020Updated 5 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆94Jul 21, 2019Updated 6 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- source{d} extension to match Git signatures to real people.☆17Nov 12, 2019Updated 6 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆343Nov 27, 2019Updated 6 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆39Oct 18, 2018Updated 7 years ago
- DeepCS: Deep Code Search☆283May 26, 2022Updated 3 years ago
- Library for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Oct 22, 2018Updated 7 years ago
- Neural Code Comprehension: A Learnable Representation of Code Semantics☆216Nov 22, 2024Updated last year
- ☆40Jun 4, 2022Updated 3 years ago
- [DISCONTINUED] Go to https://github.com/src-d/sourced-ce/☆217Oct 9, 2019Updated 6 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 4 years ago
- ☆50Feb 12, 2020Updated 6 years ago
- A Dataset of 600k Java Source Code Changes Categorized by Diff Size http://arxiv.org/pdf/2108.04631☆23Mar 22, 2024Updated last year
- A Source Code Tokenizer☆14Oct 30, 2024Updated last year
- Dataset and code corresponding to Associating Natural Language Comment and Source Code Entities (AAAI 2020)☆20Oct 24, 2020Updated 5 years ago
- gitbase web client; source{d} CE comes with a new UI, check it at https://docs.sourced.tech/community-edition/☆60Oct 9, 2019Updated 6 years ago
- [Deprecated] Source Code Generation using Sequence Generative Adversarial Networks☆75Jan 7, 2017Updated 9 years ago
- Estimating Body Fat Using Computer Vision (openCV2, Python)☆23Dec 18, 2014Updated 11 years ago
- Software vulnerabilities data set☆25Mar 4, 2020Updated 5 years ago