jplusplus / statscraper
A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.
☆13Updated last year
Alternatives and similar repositories for statscraper:
Users that are interested in statscraper are comparing it to the libraries listed below
- API client for Aleph, supports bulk entity and document upload.☆28Updated 4 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- scraper for facebook, gab, google and tiktok☆22Updated 7 months ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- ☆12Updated 5 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 3 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 2 weeks ago
- DEPRECATED. Desktop graph visualization application☆50Updated 2 years ago
- A financial disclosure data extraction tool.☆13Updated last year
- Machine assisted dossiers☆19Updated 7 years ago
- A gathering of digital methods recipes for research, teaching and collaborations from across the Public Data Lab.☆11Updated 11 months ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆46Updated 5 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆14Updated last year
- Parse Popolo JSON data and navigate it with Python☆15Updated 5 years ago
- Mecodify tool for twitter data analysis and visualisation☆42Updated last year
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Research-grade URL expansion for Python.☆26Updated 6 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- Frontend interface for Datashare, a self-hosted search engine for documents.☆34Updated this week
- Ask questions about government data.☆37Updated 6 years ago
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 3 years ago
- A library and command-line tool for fetching Facebook Pages' published posts.☆13Updated 7 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 5 months ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated 2 years ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated last year
- 📕 Writing tests, the DataMade way☆16Updated 4 years ago