fhoffa / analyzing_githubLinks
Analyzing GitHub with BigQuery and other tools
☆199Updated 5 years ago
Alternatives and similar repositories for analyzing_github
Users that are interested in analyzing_github are comparing it to the libraries listed below
Sorting:
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 7 years ago
- 💌 A tool to get email addresses by action types such as `starred`, `watching` or `fork` on GitHub repositories; Sending email content to…☆90Updated 4 years ago
- Common Crawl Index Server☆71Updated 10 months ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 2 months ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 4 years ago
- sync a website or local spreadsheet with a google sheet☆35Updated 3 years ago
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- 💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher …☆33Updated 5 months ago
- Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites".☆136Updated 6 years ago
- The Data Explorer is nteract's automatic visualization tool.☆107Updated 3 years ago
- ☆77Updated 2 years ago
- Ontology dataset for open_numbers namespace☆10Updated last week
- Track changes to GraphQL APIs by git scraping their schemas☆31Updated 9 months ago
- Open Source Programs (OSPO) Survey☆75Updated 4 months ago
- A tutorial on how to do GitHub research with GHTorrent http://ghtorrent.github.io/tutorial☆22Updated last year
- BroadbandNow is the most comprehensive resource for internet service provider plan, pricing and coverage data.☆29Updated 4 years ago
- Datasette plugin for publishing data using Vercel☆46Updated 3 years ago
- Scripts to mirror Github in a cloudy fashion☆567Updated last year
- Perspectives on Data Science for Software Engineering☆61Updated 2 years ago
- A maximum-strength name parser for record linkage.☆39Updated 4 months ago
- ☆31Updated 11 years ago
- Stream Twitter Data into BigQuery with Cloud Dataprep☆22Updated last week
- A command line tool to cluster html pages based on structural and style similarity.☆20Updated last month
- Send Sir Perceval on a quest to retrieve and gather data from software repositories.☆312Updated 3 weeks ago
- Automated data scraper: points to unstructured public data sets to create a Digital Development Exchange for open data, mapped to the Dig…☆34Updated this week
- A visual analysis tool for exploring multiverse outcomes☆33Updated 3 years ago
- Project OCEAN is an open science collaboration focused on understanding the open source ecosystems creating datasets that enable research…☆55Updated 9 months ago
- Clean personally identifiable information from dirty dirty text.☆416Updated 2 years ago
- Scraping Assisted by Learning☆36Updated 3 months ago
- Tool that tries to guess a person's gender based on their name and location☆93Updated last year