fritz-hh / OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆260Updated 9 years ago
Alternatives and similar repositories for OCRmyPDF:
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 9 years ago
- ☆31Updated 2 years ago
- Facilitating the global conversation on academic literature☆264Updated 7 years ago
- A javascript tool to visualize the diff's in wikipedia☆35Updated 2 years ago
- Data Pipes for CSV☆117Updated 2 years ago
- Docker container to provide Apache Tika RESTful API☆40Updated 9 years ago
- An API implementing a grammar for text analysis☆13Updated 9 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago
- An online annotation platform for teaching and learning in the humanities.☆107Updated this week
- a NodeJS library for monitoring changes on Wikipedia sites☆70Updated 3 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 9 years ago
- A push-button Digital Humanities laboratory.☆126Updated 6 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- A platform for collaborative social media verification☆55Updated 7 years ago
- scribe API☆81Updated 5 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆14Updated 6 years ago
- ☆29Updated 8 years ago
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆36Updated 10 months ago
- Original 2016 take at what is now Linked Paths, the demonstrator for GeoJSON-T developed under a Pelagios micro-grant☆90Updated 7 years ago
- Lens - open science content creation and display☆124Updated 8 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Enhanced Social Tagging for Academic Communities☆94Updated 4 months ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- Envisioning the future of the Hypothesis.☆40Updated 6 years ago
- A visual implementation of individual U.S. taxes☆38Updated 9 months ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- Modular workflow assistant for book digitization☆126Updated 8 years ago
- display urls being tweeted with an event hashtag☆18Updated 8 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago