eloops / hocr2pdfLinks
take scanned image, and hocr output from tesseract, create PDF. Thats it.
☆25Updated 2 years ago
Alternatives and similar repositories for hocr2pdf
Users that are interested in hocr2pdf are comparing it to the libraries listed below
Sorting:
- Simple Rust file server which lets you upload, share, and download files from a web browser. Ready-to-run binaries for Windows, Mac, and …☆58Updated 3 years ago
- PortableSigner - A Commandline and GUI Tool to digital sign PDF files with X.509 certificates☆122Updated 6 years ago
- Live SQLite3 database master-slave replication with sqlite3-rdiff using rsync over SSH☆40Updated 8 years ago
- Tools to process books in a cloud based pipeline system☆62Updated 5 months ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆97Updated 7 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git☆28Updated 2 months ago
- Multilingual handwriting recognition engine for iOS, Android, Windows, Linux, MAC OS X...☆75Updated 3 years ago
- User contributed (non Google) OCR models for Tesseract☆29Updated 5 months ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 6 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆125Updated 3 weeks ago
- Container sandbox for GUI applications☆28Updated 2 years ago
- Working with hOCR in Javascript☆135Updated 2 years ago
- Batch processing helper – GUI – for “ScanTailor-CLI” -- created by Csaba Kovacs☆15Updated 9 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆397Updated last year
- A post-processing tool for scanned sheets of paper.☆1,118Updated last year
- Automatic de-keystoning for single camera DIY book scanners.☆49Updated 5 years ago
- Minstrel is a FLOSS hybrid reading app specifically designed for Audio-eBooks☆97Updated 8 years ago
- Textricator is a tool to extract text from documents and generate structured data.☆350Updated 6 months ago
- PDF to XML ALTO file converter☆254Updated 3 weeks ago
- All Apertium language pairs, modules, tools and core☆70Updated 4 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆24Updated 10 years ago
- Communote server with core plugins☆21Updated 6 years ago
- resource scheduling and event planing☆64Updated 2 weeks ago
- Crop And Splice Segments (of scanned pages)☆14Updated 6 years ago
- Test SMTP/IMAP server for local integration testing☆13Updated 2 years ago
- Older code for Bibledit☆10Updated 8 years ago
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- Java Optical CHaracter Recognition☆22Updated last year
- An Enterprise Calendar and Scheduling System☆44Updated this week
- A post-processing tool for scanned sheets of paper.☆82Updated last year