fhoffa / analyzing_githubLinks
Analyzing GitHub with BigQuery and other tools
☆197Updated 5 years ago
Alternatives and similar repositories for analyzing_github
Users that are interested in analyzing_github are comparing it to the libraries listed below
Sorting:
- The GHtorrent project website☆157Updated last year
- Scripts to mirror Github in a cloudy fashion☆567Updated last year
- A Singer tap for extracting data from the GitHub API☆74Updated this week
- Advanced similarity and duplicate source code at scale.☆56Updated 6 years ago
- 💌 A tool to get email addresses by action types such as `starred`, `watching` or `fork` on GitHub repositories; Sending email content to…☆90Updated 4 years ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 3 years ago
- Experiments to help discussion on Wikipedia talk pages☆67Updated 2 weeks ago
- Ontology dataset for open_numbers namespace☆10Updated 10 months ago
- Crawl GitHub APIs and store the discovered orgs, repos, commits, ...☆390Updated 5 years ago
- AboutCode Toolkit provides a simple way to document provenance metadata (origin and license) about third-party code that you use in your…☆98Updated 3 months ago
- Train a model, and detect gibberish strings with it.☆66Updated 3 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 3 years ago
- Perspectives on Data Science for Software Engineering☆61Updated 2 years ago
- Scraping Assisted by Learning☆35Updated last week
- BroadbandNow is the most comprehensive resource for internet service provider plan, pricing and coverage data.☆29Updated 4 years ago
- ☆21Updated 4 years ago
- Open Source Programs (OSPO) Survey☆75Updated last month
- Clean personally identifiable information from dirty dirty text.☆415Updated 2 years ago
- A curated list of awesome SE bots☆58Updated 2 years ago
- source{d} datasets ("big code") for source code analysis and machine learning on source code☆336Updated 5 years ago
- Quickly compare changes made to Jupyter notebooks in GitHub repositories with jupydiff!☆13Updated 2 years ago
- The Data Cards Playbook helps dataset producers and publishers adopt a people-centered approach to transparency in dataset documentation.☆191Updated last year
- An API to get the dependents for any Github repository☆15Updated 5 years ago
- ☆16Updated last year
- A Singer (https://singer.io) target that writes data to Google BigQuery.☆39Updated 4 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated this week
- ☆30Updated 11 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆36Updated 3 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆14Updated 2 years ago