dbreunig / git-scraper-extractorLinks
Pull out versions of specific files from a gitscraping repo into individual files.
☆15Updated 4 years ago
Alternatives and similar repositories for git-scraper-extractor
Users that are interested in git-scraper-extractor are comparing it to the libraries listed below
Sorting:
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 5 years ago
- An SQL loader for datasets published via Socrata☆28Updated 3 years ago
- Library of Congress coding standards☆31Updated last year
- An open-source archive that gathers, saves, shares and analyzes news homepages☆151Updated 3 weeks ago
- Easily download U.S. census maps☆34Updated 2 years ago
- Python parser for the Archie Markup Language (ArchieML)☆12Updated 4 years ago
- A build tool by and for the Los Angeles Times☆29Updated 3 months ago
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆50Updated 2 years ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- Core library for the datakit CLI framework.☆57Updated 3 years ago
- Parses Google Documents formatted for annotated transcripts –– with JavaScript☆18Updated 3 years ago
- A general purpose tool for text-based crosswalking☆109Updated last year
- Twitter, quick. Fetch and store tweets on short notice.☆79Updated 9 years ago
- Add editing UI and other power-user features to Datasette.☆13Updated 2 years ago
- Scrapers for U.S. county court sites.☆74Updated 3 years ago
- ☆17Updated 8 months ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆30Updated last year
- The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.☆59Updated 8 months ago
- NPR's daily graphics rig, 2.0☆71Updated 3 months ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆56Updated last week
- Machine assisted dossiers☆19Updated 8 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆14Updated 2 years ago
- MuckRock User Service☆11Updated last week
- The build process for EveryCRSReport.com.☆72Updated last year
- A build tool for data projects.☆49Updated last year
- Save My News: A personal, permanent clipping service☆28Updated 2 years ago
- Datasette plugin for modifying table schemas☆19Updated 2 months ago
- A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCES…☆65Updated last year
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆18Updated 2 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago