miroli / swe-scrapers
A collection of web scrapers and API wrappers for Swedish sites
☆21Updated 8 years ago
Related projects: ⓘ
- A Python package for downloading data from the UK Parliament's Data Platform.☆27Updated 4 years ago
- Swedish parliamentary proceedings - Riksdagens protokoll 1867-today☆26Updated 4 months ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆69Updated 2 weeks ago
- Svenska språkresurser: kvinno- och mansnamn, orter, län, kommuner, länder, nationaliteter, yrken, sentimentlexikon, moral, stoppord, mynd…☆74Updated last year
- Fraud detection related data and scripts to share with partners.☆18Updated last year
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆67Updated 2 years ago
- Web application for disseminating statistical tables☆33Updated 3 weeks ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 3 years ago
- R package for poll retrieval from pollofpolls.eu and poll aggregation (trends) calculations☆18Updated 5 years ago
- Python library for interacting with smapp collections☆18Updated 8 years ago
- ☆37Updated this week
- ☆37Updated 5 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆29Updated last year
- Extract networks of entities from journalistic reporting☆46Updated last year
- Introduction to data journalism☆14Updated 5 years ago
- Workbook to teach the concept of risk ratios for data journalism applications☆32Updated 2 years ago
- Analysis and code to accompany The Companies We Keep briefing on the state of the UK's register of Persons of Significant Control.☆19Updated 6 years ago
- From free-form text to standardized geographical info.☆9Updated 5 years ago
- [WIP] Liquidoc is an application to facilitate the creation of adaptive text.☆10Updated 9 years ago
- Tools and lessons plans☆20Updated 7 years ago
- python library for reading json-stat format dataset☆22Updated 3 years ago
- Tools for downloading from the LexisNexis API☆17Updated 8 years ago
- ☆36Updated 2 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆13Updated 4 years ago
- Research-grade URL expansion for Python.☆25Updated 6 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Updated 3 months ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆44Updated 5 years ago
- Rank Degree Influencer Core Sampler (RaDICeS). A Twitter follow network crawler for influential accounts using the cost-free Twitter API.☆13Updated last year
- a bunch of scripts for investigaing reddit☆11Updated 7 years ago
- Machine-learning Protest Event Data System☆35Updated last month