fritz-hh / OCRmyPDFLinks
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆261Updated 9 years ago
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 10 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago
- Modular workflow assistant for book digitization☆126Updated 9 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- [DEPRECATED] Please use https://datahub.io/docs/features/data-cli☆109Updated 7 years ago
- Breve☆29Updated 5 years ago
- MOVED TO https://gitlab.com/crossref/pdfmark☆33Updated 6 years ago
- A javascript tool to visualize the diff's in wikipedia☆35Updated 2 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- ☆31Updated 2 years ago
- A collection of design patterns and best practice for the civic community.☆81Updated 7 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- Website for the LaTeX Boilerplates☆21Updated 8 years ago
- Original 2016 take at what is now Linked Paths, the demonstrator for GeoJSON-T developed under a Pelagios micro-grant☆89Updated 8 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Test cases for validating BagIt implementations☆11Updated 2 years ago
- Scan a folder of document files of all types and extract the text into a CSV suitable for Overview☆26Updated 9 years ago
- DEPRECATED. This repository is no longer maintained. Please fork and work away.☆122Updated 10 years ago
- ☆17Updated 10 years ago
- just merge all the pdfs in a directory in abc order☆49Updated 6 years ago
- Code for Newslynx App☆22Updated 9 years ago
- A Zotero plugin for mapping collections.☆35Updated 9 years ago
- Moved to:☆58Updated 5 years ago
- List of decentralized tools☆21Updated 9 years ago
- Pathways Project☆14Updated 9 years ago
- [DEPRECATED] Please use https://goodtables.io☆13Updated 8 years ago
- Adds text to PDF files using the cuneiform OCR software☆326Updated 4 years ago
- Detective.io is a platform that hosts your investigation and lets you make powerful queries to mine it. Simply describe your field of stu…☆137Updated 9 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 6 years ago