vinodbaste / paddleOCR_rec_decLinks
Optical Character Recognition (OCR) is a powerful technology that enables machines to recognize and extract text from images or scanned documents. OCR finds applications in various fields, including document digitization, text extraction from images, and text-based data analysis.
☆19Updated 2 years ago
Alternatives and similar repositories for paddleOCR_rec_dec
Users that are interested in paddleOCR_rec_dec are comparing it to the libraries listed below
Sorting:
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆146Updated 5 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆220Updated 9 months ago
- Object Detection Model for Scanned Documents☆94Updated 7 months ago
- Proceed with text detection only in the selected area of the image☆243Updated last year
- This Repository consists of all my experiments performed on LayoutLMv3 model.☆33Updated 3 years ago
- ☆385Updated last year
- Checkbox Detection Model for Scanned Documents☆89Updated 7 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆133Updated this week
- Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector☆267Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆215Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 7 months ago
- Document Layout Analysis☆390Updated last week
- Tutorial on how to deskew (straighten) text images☆52Updated 3 years ago
- YOLOv10 trained on DocLayNet dataset.☆77Updated 11 months ago
- Pytorch Implementation of TableNet☆67Updated 4 years ago
- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition☆282Updated 3 years ago
- TableNet Implementation on Pytorch☆148Updated 2 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆393Updated 2 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆51Updated last year
- A collection of OCR-related datasets☆191Updated 3 years ago
- Python library to extract tabular data from images and scanned PDFs☆283Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆136Updated last year
- Library used to deskew a scanned document☆488Updated 3 weeks ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆357Updated 2 years ago
- NanoNets OCR API Example for Python☆203Updated 3 years ago
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆465Updated 3 months ago
- Detect textlines in document images☆92Updated last year
- Key information extraction from invoice document with Graph Convolution Network☆55Updated 2 years ago
- https://dl.acm.org/doi/10.1145/3657281☆97Updated last year
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆446Updated 3 years ago