CouncilDataProject / cdp-scrapersLinks
Scratchpad for scraper development and general utilities.
☆25Updated 11 months ago
Alternatives and similar repositories for cdp-scrapers
Users that are interested in cdp-scrapers are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- Data storage utilities and processing pipelines used by CDP instances.☆22Updated 7 months ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- How can we improve name matching in screening tools?☆12Updated 5 months ago
- a python parser for the .fec file format☆46Updated 2 months ago
- Easily download U.S. census maps☆33Updated 2 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Tools for downloading agendas, minutes and other documents produced by local government☆53Updated last month
- ☆11Updated 4 months ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- ☆21Updated last week
- An extremely fast FEC filing parser written in C☆76Updated 2 months ago
- A general purpose tool for text-based crosswalking☆107Updated last year
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆13Updated 2 years ago
- semantic search for your spreadsheets☆38Updated this week
- ☆15Updated 2 months ago
- Scrapes municipal data from Legistar websites☆43Updated last week
- Platform for journalists to search, analyse, categorise and share unstructured data☆55Updated 2 weeks ago
- Data management service that brings continuous data validation to tabular data in your repository via Github Action☆41Updated last year
- 🔎 Finds fuzzy matches between datasets☆13Updated last month
- Demonstration project for building out a data news rig.☆10Updated 3 years ago
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 5 months ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 2 weeks ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆212Updated this week
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆40Updated this week
- The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.☆59Updated 2 months ago
- An SQL loader for datasets published via Socrata☆29Updated 2 years ago
- ProPublica's collaborative tip-gathering framework. Import and manage CSV, Google Sheets and Screendoor data with ease.☆100Updated 2 years ago
- A simple Python wrapper for U.S. Census Geocoding Services API batch service☆42Updated 7 months ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 11 months ago