IgorMeloS / OCRLinks
Image pre-processing and OCR techniques with OpenCV and PyTesseract
☆24Updated 3 years ago
Alternatives and similar repositories for OCR
Users that are interested in OCR are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Object Detection Model for Scanned Documents☆94Updated 8 months ago
- UniTable: Towards a Unified Table Foundation Model☆512Updated last year
- Repository for deepdoctection tutorial notebooks☆45Updated 4 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆148Updated 5 months ago
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆34Updated 3 years ago
- Handwritten text detection in document images using Detectron2☆21Updated 3 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆395Updated 2 years ago
- ☆197Updated this week
- ☆162Updated last week
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated 2 years ago
- Excel spreadsheet crawler and table parser for data extraction and querying☆161Updated 8 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆98Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆74Updated this week
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆158Updated last week
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 8 months ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆139Updated 3 months ago
- ☆22Updated last year
- Checkbox Detection Model for Scanned Documents☆89Updated 8 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆65Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆137Updated last year
- YOLOv10 trained on DocLayNet dataset.☆78Updated last year
- Document image dewarping library using a cubic sheet model