fritz-hh / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆261Updated 9 years ago
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 10 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 8 years ago
- Enhanced Social Tagging for Academic Communities☆99Updated last year
- mirror a website, put it in a bag☆24Updated 2 years ago
- ☆29Updated 8 years ago
- Modular workflow assistant for book digitization☆131Updated 9 years ago
- command line resource for working with digital primary sources☆28Updated 7 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 6 years ago
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆124Updated 10 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated 2 months ago
- Original 2016 take at what is now Linked Paths, the demonstrator for GeoJSON-T developed under a Pelagios micro-grant☆90Updated 8 years ago
- display urls being tweeted with an event hashtag☆18Updated 9 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- Extract tables from PDF files☆359Updated 9 years ago
- A push-button Digital Humanities laboratory.☆127Updated 7 years ago
- scribe API☆81Updated 6 years ago
- Politwoops web front end☆44Updated 8 years ago
- A backend store for the Annotator☆180Updated 9 years ago
- a NodeJS library for monitoring changes on Wikipedia sites☆70Updated 4 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- A place to collect and share knowledge about liberating data from PDFs☆55Updated 3 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 9 years ago
- The jQuery virtual stack plugin☆55Updated 7 years ago
- Drop in crowdsourcing for your Rails app. Extracted from Free the Files.☆83Updated 10 years ago
- Scan a folder of document files of all types and extract the text into a CSV suitable for Overview☆26Updated 9 years ago
- Social Feed Manager user interface application.☆156Updated last year
- Lens - open science content creation and display☆124Updated 9 years ago
- FromThePage is a wiki-like application for crowdsourcing transcription of handwritten documents.☆179Updated this week
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- Guides and introductions for participating in Labs and some of its projects.☆170Updated 9 years ago