raleighpublicrecord / dochiveLinks
Structured Data from PDF image-based files
☆88Updated 12 years ago
Alternatives and similar repositories for dochive
Users that are interested in dochive are comparing it to the libraries listed below
Sorting:
- A platform for tools that do stuff with data☆56Updated 6 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 12 years ago
- View, visualize, clean and process data in the browser.☆147Updated 7 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆136Updated 9 years ago
- Data Quality Dashboards display statistics on a collection of published data.☆33Updated 5 years ago
- Tools for generating portable data portals☆58Updated 2 years ago
- Guides and introductions for participating in Labs and some of its projects.☆170Updated 8 years ago
- Collaborative Innovation Class Project☆14Updated 10 years ago
- Make for data☆20Updated 6 years ago
- (DEPRECATED) Parser for U.S. federal regulations and other regulatory information☆55Updated 7 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 6 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- A Relaxed Schema Graph Database Management System☆53Updated 5 years ago
- Global Data Journalists Directory☆10Updated 6 years ago
- Exploring extracting tables from a PDF to CSV using PDF.JS☆105Updated 8 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 4 months ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 3 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- A pastebin for tables.☆34Updated 11 years ago
- Visualise Wikipedia page edits using History Flow☆49Updated 8 years ago