sotorrent / db-scripts
SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references from the BigQuery GitHub data set, and to retrieve data from the SOTorrent dataset for analysis.
☆16Updated 3 months ago
Alternatives and similar repositories for db-scripts:
Users that are interested in db-scripts are comparing it to the libraries listed below
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆22Updated 3 years ago
- A Systematic Literature Review of Deep Learning in Software Engineering☆19Updated 4 months ago
- A tool for mining graph-based change patterns in Python code☆19Updated 7 months ago
- A dynamic method for detecting faults in incremental and parallel builds.☆17Updated 2 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆40Updated 6 years ago
- PyTorch library for synthesizing programs from natural language☆18Updated 5 months ago
- ICSE'18: Tuning Smote☆11Updated 6 years ago
- Tree-based Autofolding Software Summarization Algorithm☆42Updated 8 years ago
- Flow graphs for Python☆25Updated 2 years ago
- LibSA4Py: Light-weight static analysis for extracting type hints and features☆11Updated last year
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago
- Source code and data about our large scale study about Java annotaion in practice☆12Updated last year
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 3 years ago
- A program repair tool which modifies any bugged Python script based on cues from rest of program.☆17Updated 3 years ago
- Unit testing for SQL queries☆24Updated 5 months ago
- Language-independent, search-based program repair -- just your cup of tea! ☕☆28Updated 6 months ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆29Updated 6 months ago
- A python powered normalized compression distance (NCD) calculator.☆13Updated 8 years ago
- ☆10Updated 4 years ago
- ☆10Updated 4 years ago
- The replication package of <Duplicate Bug Report Detection: How Far Are We?>. Accepted by ACM Transactions on Software Engineering and Me…☆10Updated last year
- The semantics of Java in K☆19Updated 3 years ago
- Models and datasets for annotated code search.☆35Updated last year
- A new framework to generate interpretable classification rules☆17Updated last year
- 🤓 user2code2vec: Embeddings for Profiling Students Based on Distributional Representations of Source Code. Full Paper presented at Learn…☆22Updated 5 years ago
- Code for the paper "A Structural Model for Contextual Code Changes"☆28Updated last year
- A set of tools for extracting tokens and ASTs from code☆22Updated 6 years ago
- code and data for paper "BASHEXPLAINER: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT", which accepted in…☆11Updated 2 years ago
- ☆50Updated 4 years ago
- This repository contains an implementation for design patterns detection. In this task, feature engineering and ensemble learning are app…☆10Updated 2 years ago