dinosauria123 / makepdf
Make a searchable pdf via Google Cloud Vision OCR
☆14Updated 5 years ago
Alternatives and similar repositories for makepdf
Users that are interested in makepdf are comparing it to the libraries listed below
Sorting:
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 2 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A python utility to convert .srt files to a .txt format.☆14Updated 2 years ago
- OCRmyPDF EasyOCR plugin☆84Updated last month
- NDL古典籍OCR-Liteのアプリケーションのリポジトリ(ソースコードを含む)☆100Updated 2 months ago
- Convert ALTO XML to plain text + minimal metadata☆16Updated 7 months ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 9 months ago
- guides and test data for OCR4all☆30Updated 2 years ago
- Deep Zoom Image Downloader☆20Updated 3 weeks ago
- OCR system for recognizing modern Japanese magazines☆146Updated last year
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆16Updated 2 years ago
- Automatic de-keystoning for single camera DIY book scanners.☆49Updated 4 years ago
- Command-line tile downloader/assembler for IIIF endpoints/manifests☆34Updated 3 years ago
- TEIガイドラインへの準拠の仕方を日本語で解説します。☆12Updated 4 years ago
- Tools to process books in a cloud based pipeline system☆61Updated last month
- Process, enhance and evaluate multiple OCR output.☆22Updated 6 months ago
- Conversions between various OCR formats☆77Updated 2 years ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆176Updated 2 weeks ago
- OCR with Google's AI technology (Cloud Vision API)☆74Updated 2 years ago
- 🌸De-inflect Japanese words☆12Updated last month
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆66Updated this week
- Repository hosting the common code for the entity-fishing clients☆10Updated 11 months ago
- OCR-D python tools☆33Updated 9 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆40Updated 2 weeks ago
- Top 5000 Japanese family names, with readings, ordered by frequency.☆16Updated 7 years ago
- A context-based spellchecker for correcting OCR output.☆19Updated 2 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆65Updated last year