fhoffa / analyzing_github
Analyzing GitHub with BigQuery and other tools
☆193Updated 4 years ago
Alternatives and similar repositories for analyzing_github:
Users that are interested in analyzing_github are comparing it to the libraries listed below
- Supplementary material for MSR2017 paper Structure and Evolution of Package Dependency Networks☆18Updated 6 years ago
- Perspectives on Data Science for Software Engineering☆61Updated 2 years ago
- Tools used to create the data on TravisTorrent (http://travistorrent.testroots.org).☆43Updated 2 years ago
- Scripts to mirror Github in a cloudy fashion☆565Updated 11 months ago
- Calculate the score of a repository based on best engineering practices.☆111Updated 4 years ago
- The GHtorrent project website☆153Updated 8 months ago
- 🪐 A Database of Existing Security Vulnerabilities Patches to Enable Evaluation of Techniques (single-commit; multi-language)☆38Updated 2 years ago
- evaluation dataset consisting of natural language query and code snippet pairs☆123Updated 10 months ago
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆22Updated 3 years ago
- Homepage for 17-803 "Empirical Methods" at Carnegie Mellon University☆127Updated last year
- Scraping Assisted by Learning☆35Updated this week
- Paper Artifacts for "Aroma: Code Recommendation via Structural Code Search"☆58Updated 3 years ago
- DataOps for Government☆34Updated 6 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 3 years ago
- ☆16Updated 4 years ago
- Utilities used by the Deep Program Understanding team☆102Updated last year
- Paper reading club at source{d}☆115Updated 5 years ago
- Predict code bug risk with git metadata☆42Updated 5 years ago
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆73Updated 7 years ago
- Assessment of the pull based development model, as implemented by Github☆72Updated 6 years ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆47Updated 2 years ago
- ☆14Updated 4 years ago
- Send Sir Perceval on a quest to retrieve and gather data from software repositories.☆298Updated last week
- A Systematic Literature Review of Deep Learning in Software Engineering☆19Updated 7 months ago
- Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations☆61Updated 3 years ago
- Finding similar repositories on GitHub☆48Updated 2 years ago
- code and data for paper "BASHEXPLAINER: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT", which accepted in…☆12Updated 2 years ago
- Home page of project "KB"☆121Updated this week
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Babelfish Python client☆16Updated 5 years ago