jplusplus / statscraper
A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.
☆13Updated last year
Alternatives and similar repositories for statscraper:
Users that are interested in statscraper are comparing it to the libraries listed below
- API client for Aleph, supports bulk entity and document upload.☆28Updated 3 months ago
- A financial disclosure data extraction tool.☆13Updated last year
- ☆12Updated 5 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- scraper for facebook, gab, google and tiktok☆22Updated 6 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 3 years ago
- DEPRECATED. Desktop graph visualization application☆50Updated 2 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆78Updated last year
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Ask questions about government data.☆37Updated 6 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A Python library for defining rule-based overrides on messy data☆13Updated last month
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 5 months ago
- Research-grade URL expansion for Python.☆26Updated 6 years ago
- Scraping Assisted by Learning☆35Updated this week
- Datakit plugin to help manage Github integration on data projects.☆12Updated 2 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated last year
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- Source code that reproduces the results from the paper "Who Let The Trolls Out? Towards Understanding State-Sponsored Trolls" (https://ar…☆20Updated 5 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Updated 11 years ago
- Python utilities to make it a little easier to set up and run a Twitter bot☆40Updated last year
- R Shiny App created to predict the success rate of Freedom of Information Act requests.☆16Updated 7 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- Mecodify tool for twitter data analysis and visualisation☆42Updated last year
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆67Updated 3 years ago
- Examples for getting started using https://case.law☆65Updated 2 years ago