pudo-attic / extractorsLinks
Re-usable wrapper scripts for text document extractors.
☆37Updated 9 years ago
Alternatives and similar repositories for extractors
Users that are interested in extractors are comparing it to the libraries listed below
Sorting:
- A repository of journalist's lookup tables.☆107Updated 8 years ago
- An icon font of New York City boroughs.☆14Updated 10 years ago
- Whippersnapper is an automated screenshot tool to keep a visual history of content on the web.☆55Updated 9 years ago
- A simple script to look for and process all the federal data.json data inventories.☆46Updated 10 years ago
- Open source tool to help journalists easily mash up data based on shared geography.☆59Updated 10 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- For watching a set of URLs and notifying someone when something has changed.☆32Updated 8 years ago
- Tables is a simple command-line tool and powerful library for importing data like a CSV or JSON file into relational tables☆88Updated 3 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 10 years ago
- A collection of introductions to various datasets, giving journalists some friendly background before they start doing analysis. Like "Hi…☆71Updated 11 years ago
- ☆23Updated 10 years ago
- The slides and code from my NICAR talk.☆41Updated 8 years ago
- ☆15Updated 10 years ago
- pneumatic is a bulk-upload library for DocumentCloud.☆22Updated 5 years ago
- ☆24Updated 9 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- Geocode CSVs and jitter overlapping points☆22Updated 8 years ago
- This semester we will work together to gather, analyze and visualize numbers you need to understand your audience and to tell interactive…☆17Updated 7 years ago
- A Python wrapper for the OpenFEC API.☆28Updated 6 years ago
- Tools and lessons plans☆20Updated 8 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 8 years ago
- A how-to do a mass collection of FEC data using the command-line and regular expressions☆29Updated 9 years ago
- Parse live video and extract Chyron text☆20Updated 8 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Machine assisted dossiers☆19Updated 8 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 10 years ago
- A Los Angeles Times analysis of serious assaults misclassified by LAPD☆63Updated 7 years ago
- Tutorial on visualizing data with matplotlib and pandas for NICAR16☆38Updated 9 years ago
- Code for Newslynx App☆22Updated 10 years ago
- Loose Miscellany☆21Updated 8 years ago