sotorrent / db-scriptsLinks
SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.
☆16Updated 3 months ago
Alternatives and similar repositories for db-scripts
Users that are interested in db-scripts are comparing it to the libraries listed below
Sorting:
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- ☆14Updated last year
- ☆49Updated 5 years ago
- Probabilistic API Mining☆53Updated 7 years ago
- BEE (Bug rEport analyzEr), a tool for structuring and analyzing bug reports☆26Updated last year
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆32Updated 7 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆151Updated 4 years ago
- ☆49Updated 2 years ago
- Website for Learning from "Big Code"☆29Updated 4 years ago
- src2abs is a tool that abstracts Java source code☆35Updated 6 years ago
- mwcvitkovic / Open-Vocabulary-Learning-on-Source-Code-with-a-Graph-Structured-Cache--Code-PreprocessorLibrary for preprocessing java source code into Augmented ASTs, as per the paper Open Vocabulary Learning on Source Code with a Graph-Str…☆21Updated 6 years ago
- A Systematic Literature Review of Deep Learning in Software Engineering☆19Updated 11 months ago
- Bilateral Neural Network implementation in Tensorflow☆51Updated 6 years ago
- Source code understanding via Machine Learning techniques☆137Updated 2 years ago
- Tree-based Autofolding Software Summarization Algorithm☆42Updated 9 years ago
- ☆23Updated last year
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆39Updated 6 years ago
- Convert source code into numerical tokens☆65Updated 2 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆17Updated last year
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Updated 2 years ago
- Hoppity☆59Updated 4 years ago
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆22Updated 4 years ago
- Code to reproduce the experiments in the paper Open Vocabulary Learning on Source Code with a Graph-Structured Cache☆21Updated 6 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆26Updated 3 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Updated 5 years ago
- ☆25Updated 4 years ago
- A dynamic method for detecting faults in incremental and parallel builds.☆19Updated 3 years ago
- ☆13Updated 2 years ago
- Hosts our tool for mining simple "stupid'' bugs (SStuBs).☆38Updated 3 years ago