raleighpublicrecord / dochiveLinks
Structured Data from PDF image-based files
☆88Updated 12 years ago
Alternatives and similar repositories for dochive
Users that are interested in dochive are comparing it to the libraries listed below
Sorting:
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 12 years ago
- View, visualize, clean and process data in the browser.☆147Updated 7 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- ☆29Updated 8 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated last week
- Shave pages off of PDFs as images☆59Updated 7 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆136Updated 10 years ago
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- Parser for U.S. federal regulations and other regulatory information☆40Updated 2 years ago
- Open Data Index website☆41Updated 7 years ago
- Lacuna: Digital Annotation for Teaching and Learning☆37Updated 6 years ago
- A Relaxed Schema Graph Database Management System☆53Updated 5 years ago
- Ideas for (tech) stuff to research, build or work on.☆50Updated 7 months ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- Transform any dataset into an HTTP API with The DataTank☆82Updated 5 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 4 years ago
- Segrada - Semantic Graph Database☆70Updated 4 months ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- A Python web application for converting PDF forms into PDF-filling APIs☆48Updated 4 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- This is the repo for the Data on the Web Best Practices Working Group.☆52Updated 4 years ago
- A library for extracting tables from PDF files☆89Updated 11 years ago
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- Schemas and helpful handlers for OADA-related formats.☆16Updated 5 years ago
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 10 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated this week
- Slaw is a lightweight library for rendering and generating Akoma Ntoso acts from plain text and PDF documents.☆27Updated 3 years ago