opensecrets / OCRToolkit
Tools for working with Optical Character Recognition output
☆16Updated 10 years ago
Alternatives and similar repositories for OCRToolkit:
Users that are interested in OCRToolkit are comparing it to the libraries listed below
- Machine assisted dossiers☆19Updated 7 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Updated 7 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- JSON schemas for OpenCorporates data☆19Updated 8 months ago
- Responsively embed DocumentCloud pages.☆22Updated 6 years ago
- Whippersnapper is an automated screenshot tool to keep a visual history of content on the web.☆55Updated 8 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Data storytelling. See link for detailed documentations: http://lab41.github.io/gestalt.☆20Updated 8 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- A platform for tools that do stuff with data☆56Updated 5 years ago
- A Node.js wrapper around the DocumentCloud API.☆12Updated 7 years ago
- Structured Data from PDF image-based files☆87Updated 11 years ago
- Command line utility for d3-pre pre-rendering pipeline☆13Updated 8 years ago
- Where things come from in Who's On First.☆21Updated 10 months ago
- javascript implementation of nmap (Neighborhood Preservation Space-filling Algorithm)☆18Updated 9 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Common UI Library that powers Polestar and Voyager☆13Updated 8 years ago
- Diving into the data behind signs on Illinois highways that say "957 TRAFFIC DEATHS IN 2012." #peoplenotdata☆16Updated 3 years ago
- Data Quality Dashboards display statistics on a collection of published data.☆33Updated 4 years ago
- an explorable budget vizualization for New York state☆28Updated last week
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆47Updated last year
- CKAN Resource View to build maps and choropleth maps☆26Updated last year
- Simple JSON API for small crowdsourcing apps☆13Updated 7 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Process US Census data into a seamless dataset.☆16Updated 3 weeks ago
- CSV grooming, the JS way☆21Updated 5 years ago
- The news homepage archive☆81Updated 3 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Updated 9 years ago