sotorrent / db-scripts
SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.
☆15Updated last month
Related projects ⓘ
Alternatives and complementary repositories for db-scripts
- A Systematic Literature Review of Deep Learning in Software Engineering☆19Updated 2 months ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆29Updated 4 months ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Updated 5 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆40Updated 6 years ago
- Smelling smells using Deep Learning☆44Updated 3 years ago
- Source code for the Naturalize project☆56Updated 9 years ago
- code2vec: Learning Distributed Representations of Code☆14Updated 6 years ago
- ☆10Updated 4 years ago
- ☆11Updated 3 years ago
- Characterizing the natural language descriptions in software logging statements [ASE'18]☆17Updated 5 years ago
- Tree-based Autofolding Software Summarization Algorithm☆42Updated 8 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆26Updated 3 years ago
- Babelfish Python client☆16Updated 5 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Updated 4 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 3 years ago
- ICSE'18: Tuning Smote☆11Updated 6 years ago
- ☆16Updated 4 years ago
- ☆11Updated 5 months ago
- C# Data Extraction for "Learning to Represent Edits"☆27Updated 6 years ago
- A tool for mining graph-based change patterns in Python code☆19Updated 6 months ago
- The semantics of Java in K☆19Updated 3 years ago
- Tree Language Models☆9Updated 7 years ago
- Website for Learning from "Big Code"☆29Updated 3 years ago
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆32Updated 6 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Updated 2 years ago
- ☆50Updated 4 years ago
- Flow graphs for Python☆25Updated 2 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆12Updated 7 months ago
- A toolkit for pre-processing large source code corpora☆46Updated 2 years ago
- ☆16Updated 4 months ago