ocrmypdf / OCRmyPDF-EasyOCRLinks
OCRmyPDF EasyOCR plugin
☆85Updated 2 months ago
Alternatives and similar repositories for OCRmyPDF-EasyOCR
Users that are interested in OCRmyPDF-EasyOCR are comparing it to the libraries listed below
Sorting:
- Building scantailor and its dependencies☆58Updated last year
- A post-processing tool for scanned sheets of paper.☆82Updated last year
- Document image dewarping library using a cubic sheet model☆158Updated this week
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆52Updated 2 months ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆213Updated 2 months ago
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆18Updated last month
- web based editor for subtitles and transcripts☆133Updated 9 months ago
- Scan Tailor Experimental is an interactive post-processing tool for scanned pages.☆69Updated 2 weeks ago
- A collection of tools for cleaning up book scans.☆142Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆205Updated 5 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆186Updated last week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆27Updated 2 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A Python library to extract tabular data from PDFs☆65Updated 2 months ago
- Docker Image with latest Tesseract OCR Version 5.x.x built from sources☆38Updated this week
- A curated list of resources around PDF files☆133Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆151Updated last year
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆114Updated this week
- This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging t…☆33Updated 4 months ago
- These are widgets for use in Grist. Feel free to use, copy, fork as you wish (as long as you keep it all open source).☆11Updated 6 months ago
- convert ZIM files, as found in the Kiwix WikiPedia library, to a SQLite database that is read by the WikiReader plugin of KOReader☆18Updated last year
- Docx tracked change redlines for the Python ecosystem.☆62Updated 11 months ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆237Updated 9 months ago
- A Python asyncio wrapper for Tesseract-OCR.☆26Updated 7 months ago
- Tutorial on how to deskew (straighten) text images☆51Updated 3 years ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆180Updated last week
- Document Layout Analysis☆376Updated 3 weeks ago
- This repository contains code for line detection, character detection and recognition on the cuneiform 2d images☆32Updated 5 years ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆126Updated last month
- Repository for deepdoctection tutorial notebooks☆45Updated 6 months ago