pudo-attic / extractors
Re-usable wrapper scripts for text document extractors.
☆37Updated 8 years ago
Alternatives and similar repositories for extractors:
Users that are interested in extractors are comparing it to the libraries listed below
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- A repo of class materials for NICAR16☆11Updated 9 years ago
- An icon font of New York City boroughs.☆14Updated 10 years ago
- For watching a set of URLs and notifying someone when something has changed.☆31Updated 7 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆17Updated 2 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- pneumatic is a bulk-upload library for DocumentCloud.☆23Updated 4 years ago
- Data and analysis supporting several passages in the BuzzFeed News article, "The New American Slavery: Invited To The U.S., Foreign Worke…☆28Updated 8 years ago
- ☆23Updated 10 years ago
- Chrome extension that highlights anonymous sources in news articles☆33Updated 8 years ago
- FOIL resources for New York City and New York State☆19Updated 9 years ago
- ☆14Updated 9 years ago
- A collection of introductions to various datasets, giving journalists some friendly background before they start doing analysis. Like "Hi…☆71Updated 10 years ago
- A simple script to look for and process all the federal data.json data inventories.☆46Updated 10 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Parse live video and extract Chyron text☆20Updated 7 years ago
- Square tilemap of Switzerland, as GeoJSON, SVG, and Shapefile. Please feel free to use. The only thing we ask for is to credit like so: "…☆19Updated 3 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- Geocode CSVs and jitter overlapping points☆22Updated 8 years ago
- Open source tool to help journalists easily mash up data based on shared geography.☆59Updated 9 years ago
- A system to track lawmakers and legislation.☆9Updated 3 years ago
- A handbook of best practices and case studies for modern collaborative journalism☆13Updated 7 years ago
- Tracking my progress in doing GIS/Geospatial work in Python 3.x☆12Updated 8 years ago
- The slides and code from my NICAR talk.☆41Updated 8 years ago
- ☆36Updated 7 years ago
- Whippersnapper is an automated screenshot tool to keep a visual history of content on the web.☆55Updated 9 years ago
- Politwoops web front end☆44Updated 7 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 9 years ago
- The TK Toolkit. Utilities for working with data in Node.js.☆15Updated 9 years ago