src-d / reading-club
Paper reading club at source{d}
☆115Updated 4 years ago
Related projects: ⓘ
- sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees☆141Updated 5 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆40Updated 5 years ago
- 58069 Java source code diffs. http://arxiv.org/pdf/1807.03200☆91Updated 5 years ago
- CodRep 2019 edition.☆20Updated 4 years ago
- Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.☆46Updated 5 years ago
- Source code for the Naturalize project☆56Updated 9 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆322Updated 4 years ago
- Open paper reading club @ JetBrains☆35Updated 3 months ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- A tool for mining commits from Git repositories and diffs to automatically extract code change pattern instances and features with ast a…☆92Updated 3 months ago
- Finding similar repositories on GitHub☆45Updated last year
- GenProg: heuristic, GP-based automatic program repair for C.☆89Updated 3 years ago
- Demonstration of the path-extraction process shown in the paper "A General Path-Based Representation for Predicting Program Properties"☆24Updated 3 years ago
- [ICSE'18] Hierarchical Learning of Cross-Language Mappings through Distributed Vector Representations for Code☆22Updated 6 years ago
- Python library to share machine learning models easily and reliably.☆18Updated 4 years ago
- Website for Learning from "Big Code"☆29Updated 3 years ago
- Babelfish Python client☆16Updated 4 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Updated 4 years ago
- Machine learning models for MLonCode trained using the source{d} stack☆19Updated 4 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆288Updated last month
- Lookout Style Analyzer: fixing code formatting and typos during code reviews☆32Updated last year
- Your library for dynamic language modeling☆66Updated 5 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆123Updated 4 months ago
- source{d} MLonCode foundation - core algorithms and models.☆14Updated 4 years ago
- Code for "Typilus: Neural Type Hints" PLDI 2020☆59Updated last year
- Babelfish documentation (GitBook)☆41Updated 4 years ago
- jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source co…☆71Updated 5 years ago
- A library for mining of path-based representations of code (and more)☆280Updated 9 months ago
- Perspectives on Data Science for Software Engineering☆59Updated last year
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆83Updated last year