overview / docs2csv
Scan a folder of document files of all types and extract the text into a CSV suitable for Overview
☆26Updated 9 years ago
Alternatives and similar repositories for docs2csv:
Users that are interested in docs2csv are comparing it to the libraries listed below
- ☆23Updated 10 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Ask questions about government data.☆37Updated 6 years ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- Code for Newslynx App☆22Updated 9 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- A library and command-line tool for fetching Facebook Pages' published posts.☆13Updated 7 years ago
- A system to track lawmakers and legislation.☆9Updated 3 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 8 years ago
- Tools and lessons plans☆20Updated 8 years ago
- Command-line tool for exploring the PAC donor-recipient relationship☆55Updated 10 years ago
- ☆14Updated 9 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Updated 7 years ago
- Breve☆28Updated 5 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- A collection of introductions to various datasets, giving journalists some friendly background before they start doing analysis. Like "Hi…☆71Updated 10 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- Open source tool to help journalists easily mash up data based on shared geography.☆59Updated 9 years ago
- An online reference for data journalism☆25Updated 11 years ago
- ☆23Updated 9 years ago
- A handbook of best practices and case studies for modern collaborative journalism☆13Updated 7 years ago
- For watching a set of URLs and notifying someone when something has changed.☆32Updated 7 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- DAta Nudged into GIT - File-based datasets that use git for version control of individual records☆11Updated 3 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated last year
- Encryption for Journalists - Hacks/Hackers NYC☆40Updated 11 years ago
- Turns legal citations in the DOM into links☆20Updated 8 years ago
- This semester we will work together to gather, analyze and visualize numbers you need to understand your audience and to tell interactive…☆16Updated 6 years ago