fhoffa / analyzing_githubLinks
Analyzing GitHub with BigQuery and other tools
☆193Updated 5 years ago
Alternatives and similar repositories for analyzing_github
Users that are interested in analyzing_github are comparing it to the libraries listed below
Sorting:
- The GHtorrent project website☆155Updated last year
- Advanced similarity and duplicate source code at scale.☆55Updated 6 years ago
- Online service for analyzing research profiles of scientists and conferences☆13Updated 2 years ago
- Where I keep my Python notes for starting projects☆9Updated 2 years ago
- A tutorial on how to do GitHub research with GHTorrent http://ghtorrent.github.io/tutorial☆21Updated last year
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆73Updated 7 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 3 years ago
- Predict code bug risk with git metadata☆42Updated 5 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- Scraping Assisted by Learning☆35Updated 2 months ago
- Running Python Code in BigQuery UDFs☆24Updated 4 years ago
- Machine learning model to recommend related content☆19Updated last year
- plait.py - a fake data modeler☆435Updated 6 years ago
- Scripts to mirror Github in a cloudy fashion☆566Updated last year
- Train a model, and detect gibberish strings with it.☆64Updated 3 years ago
- Perspectives on Data Science for Software Engineering☆61Updated 2 years ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 2 years ago
- The Art of Data Science☆36Updated 6 years ago
- A Foursquare data scraper that gathers all venues within a specified geographic area.☆39Updated 6 years ago
- Techniques for Scraping the Web in Python☆25Updated 7 years ago
- Quickly compare changes made to Jupyter notebooks in GitHub repositories with jupydiff!☆13Updated 2 years ago
- Snippets of code used in blog posts and other media.☆13Updated this week
- Ontology dataset for open_numbers namespace☆10Updated 8 months ago
- Utilities used by the Deep Program Understanding team☆102Updated 2 years ago
- ICSE 2021 Artifact for: Shipwright: A Human-in-the-Loop System for Dockerfile Repair.☆22Updated 4 years ago
- Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites".☆131Updated 5 years ago
- A Singer tap for extracting data from the GitHub API☆74Updated 2 weeks ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆33Updated 3 years ago
- This repository contains a script used to get the GitHub profile information of all the people who've Stared a given GitHub repository☆68Updated 6 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago