fhoffa / analyzing_githubLinks
Analyzing GitHub with BigQuery and other tools
☆199Updated 5 years ago
Alternatives and similar repositories for analyzing_github
Users that are interested in analyzing_github are comparing it to the libraries listed below
Sorting:
- Advanced similarity and duplicate source code at scale.☆56Updated 6 years ago
- The GHtorrent project website☆158Updated last year
- Scripts to mirror Github in a cloudy fashion☆568Updated last year
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 7 years ago
- A Singer tap for extracting data from the GitHub API☆75Updated last week
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- A command line tool to cluster html pages based on structural and style similarity.☆20Updated 4 months ago
- A scraper focused on organizational Github accounts and their members.☆42Updated last month
- Donations list website (DLW): a repository for keeping track of public donations by some people I (arbitrarily) decide to track☆20Updated last month
- Scraping Assisted by Learning☆36Updated 2 months ago
- Predict code bug risk with git metadata☆42Updated 6 years ago
- BroadbandNow is the most comprehensive resource for internet service provider plan, pricing and coverage data.☆29Updated 4 years ago
- Calculate the score of a repository based on best engineering practices.☆113Updated 5 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 4 years ago
- Track changes to GraphQL APIs by git scraping their schemas☆30Updated 8 months ago
- ☆11Updated 2 years ago
- ☆13Updated 3 years ago
- dbt data models for facebook ads☆41Updated last year
- Code and data belonging to our CSCW 2019 paper: "Dark Patterns at Scale: Findings from a Crawl of 11K Shopping Websites".☆135Updated 6 years ago
- scrape messages from slack channels☆31Updated 6 years ago
- Repo for the Stitch Docs☆58Updated this week
- CLK hash: hash pii for entity matching☆47Updated 7 months ago
- Drag N Drop WepApp to Build and Manage Airflow DAGs☆25Updated 2 years ago
- 💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher …☆33Updated 4 months ago
- Get statistics on web traffic to your GitHub repositories.☆128Updated 2 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆341Updated 6 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
- Running Python Code in BigQuery UDFs☆24Updated 5 years ago
- Venmo trasaction dataset for data analysis/visualization/anything☆210Updated 5 years ago
- Generating Realistic Synthetic Data☆41Updated last year