jplusplus / statscraperLinks
A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.
☆13Updated 3 months ago
Alternatives and similar repositories for statscraper
Users that are interested in statscraper are comparing it to the libraries listed below
Sorting:
- scraper for facebook, gab, google and tiktok☆21Updated 2 weeks ago
- A financial disclosure data extraction tool.☆16Updated last year
- API client for Aleph, supports bulk entity and document upload.☆28Updated 7 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- ☆11Updated 6 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- A Python library for defining rule-based overrides on messy data☆14Updated last month
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 8 months ago
- Scraping Assisted by Learning☆35Updated 2 weeks ago
- Deduplicate and parse list of `dirty names'☆23Updated 4 years ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 4 years ago
- A database of public bodies such as government departments, ministries etc.☆68Updated 4 months ago
- Research-grade URL expansion for Python.☆27Updated 7 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- IWAAN - An interactive Jupyter Notebook collection that allows to run analyses of Wikipedia article editing dynamics out-of-the-box on Bi…☆9Updated last year
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- Ask questions about government data.☆37Updated 6 years ago
- Actor Network Text Analyser☆56Updated 10 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆35Updated this week
- Basic cookiecutter template for Python projects☆21Updated 8 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- A collection of interesting software, processes, and methodologies built and used across Public Media.☆17Updated 5 years ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 4 years ago
- A helper library full of URL-related heuristics.☆69Updated 2 months ago