raleighpublicrecord / dochiveLinks
Structured Data from PDF image-based files
☆88Updated 12 years ago
Alternatives and similar repositories for dochive
Users that are interested in dochive are comparing it to the libraries listed below
Sorting:
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 3 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 11 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- **el.vis** - a tool for visualising public (EU) tenders big data☆8Updated 2 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 7 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 8 years ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 10 years ago
- Provide RESTful access to SKOS vocabularies☆58Updated 2 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Lacuna: Digital Annotation for Teaching and Learning☆37Updated 6 years ago
- Create and validate Data Packages in the browser☆27Updated 3 years ago
- Tools for generating portable data portals☆58Updated 2 years ago
- Ideas for (tech) stuff to research, build or work on.☆50Updated 5 months ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- The Wikiba.se website☆22Updated 7 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- A platform for tools that do stuff with data☆56Updated 6 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated this week
- Linked Open Vocabularies (LOV) - scripts☆9Updated 8 years ago
- Using social media to steer web archiving and curation.☆15Updated 9 years ago
- paginate-for-print is trying to recreate some of the basic features of pagination.js without using CSS Regions with a focus on Chrome, Fi…☆25Updated 5 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Data Store for Annotation Studio☆46Updated 2 years ago
- View, visualize, clean and process data in the browser.☆147Updated 7 years ago