ocrmypdf / OCRmyPDF-EasyOCR
OCRmyPDF EasyOCR plugin
☆84Updated last month
Alternatives and similar repositories for OCRmyPDF-EasyOCR
Users that are interested in OCRmyPDF-EasyOCR are comparing it to the libraries listed below
Sorting:
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆176Updated 2 weeks ago
- Document image dewarping library using a cubic sheet model☆155Updated this week
- A post-processing tool for scanned sheets of paper.☆81Updated last year
- Building scantailor and its dependencies☆58Updated last year
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆52Updated last month
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆109Updated this week
- This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging t…☆33Updated 3 months ago
- web based editor for subtitles and transcripts☆130Updated 9 months ago
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Lev…☆29Updated last year
- Library used to deskew a scanned document☆461Updated 2 weeks ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆136Updated this week
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆340Updated 2 years ago
- ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones …☆232Updated 9 months ago
- Document Image Binarization☆78Updated 7 months ago
- This repository contains code for line detection, character detection and recognition on the cuneiform 2d images☆32Updated 5 years ago
- Document Layout Analysis☆373Updated this week
- Tutorial on how to deskew (straighten) text images☆51Updated 3 years ago
- Apply different text recognition services to images of handwritten documents.☆178Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- Docx tracked change redlines for the Python ecosystem.☆62Updated 10 months ago
- This Repository contains a Jupyter notebook explaining how to detect checkboxes/table cells from a scanned image☆31Updated 4 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆69Updated last month
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆106Updated 2 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆202Updated 4 months ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆212Updated last month
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 9 months ago
- Python library to extract tabular data from images and scanned PDFs☆278Updated 9 months ago
- Training scripts for Argos Translate☆131Updated 6 months ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆152Updated 7 months ago