fhoffa / analyzing_github
Analyzing GitHub with BigQuery and other tools
☆187Updated 4 years ago
Related projects: ⓘ
- Scripts to mirror Github in a cloudy fashion☆559Updated 5 months ago
- Tools used to create the data on TravisTorrent (http://travistorrent.testroots.org).☆42Updated last year
- The GHtorrent project website☆147Updated 2 months ago
- Homepage for 17-803 "Empirical Methods" at Carnegie Mellon University☆123Updated 6 months ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆87Updated 6 years ago
- Supplementary material for MSR2017 paper Structure and Evolution of Package Dependency Networks☆18Updated 5 years ago
- Advanced similarity and duplicate source code at scale.☆54Updated 5 years ago
- Collect information about dependencies between a github repo and other repositories. Results available in JSON, markdown and badge☆104Updated this week
- Calculate the score of a repository based on best engineering practices.☆106Updated 3 years ago
- Perspectives on Data Science for Software Engineering☆59Updated last year
- Python wrapper for libraries.io API☆16Updated 2 weeks ago
- ☆30Updated this week
- ☆13Updated 3 years ago
- ☆21Updated 3 years ago
- Scalpel: The Python Static Analysis Framework☆285Updated 5 months ago
- ☆16Updated 4 years ago
- Code associated with a research project for experimenting with different ways of surfacing stylistic, analytic, or visual discrepancies i…☆14Updated 3 weeks ago
- ☆12Updated last year
- Finding similar repositories on GitHub☆45Updated last year
- Neural bag of words code search implementation using PyTorch and data from the CodeSearchNet project.☆69Updated last year
- evaluation dataset consisting of natural language query and code snippet pairs☆123Updated 4 months ago
- Assessment of the pull based development model, as implemented by Github☆73Updated 5 years ago
- Babelfish Python client☆16Updated 4 years ago
- A command line tool to cluster html pages based on structural and style similarity.☆19Updated 2 months ago
- ☆34Updated 3 years ago
- Crawl GitHub APIs and store the discovered orgs, repos, commits, ...☆373Updated 4 years ago
- Working Group focused on Evolution metrics (for software development projects)☆39Updated 10 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆288Updated last month
- Stream Twitter Data into BigQuery with Cloud Dataprep☆22Updated 2 weeks ago
- Snippets of code used in blog posts and other media.☆13Updated last year