dinosauria123 / makepdf
Make a searchable pdf via Google Cloud Vision OCR
☆14Updated 5 years ago
Alternatives and similar repositories for makepdf:
Users that are interested in makepdf are comparing it to the libraries listed below
- Tool to OCR PDFs using Google Cloud Vision☆39Updated 2 years ago
- Convert ALTO XML to plain text + minimal metadata☆13Updated 3 months ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆104Updated 4 years ago
- Docker integration of Kitodo.Production and OCR-D☆9Updated 10 months ago
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆16Updated 2 years ago
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆33Updated 3 years ago
- Ergonomic line-by-line transcription of scanned text.☆50Updated 4 years ago
- Docker setup for OCR4all bundled with Larex☆21Updated last year
- ☆10Updated 3 years ago
- guides and test data for OCR4all☆30Updated 2 years ago
- ☆24Updated 2 weeks ago
- Automatic de-keystoning for single camera DIY book scanners.☆49Updated 4 years ago
- Locolligo is a single-page, browser-based javascript application to facilitate the formatting, linking, and geolocation of datasets, with…☆14Updated 11 months ago
- DOCX to JATS XML Converter☆20Updated 2 years ago
- Example repo of using esmodules with Apps Script☆30Updated 3 years ago
- Efficient hOCR tooling☆42Updated 4 months ago
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated this week
- Conversions between various OCR formats☆73Updated last year
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆32Updated last year
- A repository to organize materials from the AI4LAM Teach and Learning Working Group☆14Updated last year
- The Reference Stylesheets developed and released by EpiDoc for use with XML documents following the EpiDoc schema.☆16Updated this week
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆138Updated this week
- Python script for exporting metadata from Omeka to a CSV file via the Omeka API.☆17Updated last month
- Process, enhance and evaluate multiple OCR output.☆22Updated 3 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... Fast!!☆32Updated 2 weeks ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago
- IIIF experiments with Gallica content☆25Updated 3 months ago
- 🌸De-inflect Japanese words☆12Updated 2 years ago