jplusplus / statscraper
A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.
☆13Updated 2 months ago
Alternatives and similar repositories for statscraper:
Users that are interested in statscraper are comparing it to the libraries listed below
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- scraper for facebook, gab, google and tiktok☆21Updated 10 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Ask questions about government data.☆37Updated 6 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 4 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last month
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- A maximum-strength name parser for record linkage.☆37Updated this week
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 5 years ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆27Updated last year
- Examples for getting started using https://case.law☆65Updated 2 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 10 months ago
- Research-grade URL expansion for Python.☆27Updated 6 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- A Python library for defining rule-based overrides on messy data☆13Updated 2 weeks ago
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- All of our code examples and tutorials☆66Updated 6 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- R Shiny App created to predict the success rate of Freedom of Information Act requests.☆16Updated 7 years ago
- ☆11Updated 5 years ago
- All the files and documentation necessary to reuse, remix and translate A Field Guide to "Fake News" and Other Information Disorders.☆61Updated 4 years ago