danielgatis / docscan
Docscan is a document scanner. Take a photo of your documents and frame it.
☆95Updated last week
Related projects ⓘ
Alternatives and complementary repositories for docscan
- Tutorial on how to deskew (straighten) text images☆50Updated 2 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆77Updated last year
- An application of high resolution GANs to dewarp images of perturbed documents☆125Updated 3 years ago
- Library used to deskew a scanned document☆418Updated last month
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- Real-time detection of documents in images☆75Updated 2 months ago
- Document Image Binarization☆73Updated last month
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 months ago
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆51Updated this week
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆116Updated 11 months ago
- Detect textlines in document images☆90Updated 5 months ago
- Detect and read handwritten words on scanned pages.☆106Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆127Updated last week
- A function that takes as input a cropped text line image, and outputs the dewarped image.☆16Updated 2 weeks ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆31Updated 2 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 3 years ago
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆105Updated last year
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 3 years ago
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆50Updated 11 months ago
- A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping☆101Updated 2 years ago
- OCR-D-compliant page segmentation☆67Updated 2 months ago
- Deep learning based page layout analysis☆195Updated 5 years ago
- Integrate AI-powered Document Analysis Pipelines☆62Updated this week
- python ocr using tesseract/ with EAST opencv detector☆42Updated 4 months ago
- Perspective recovery of text using transformed ellipses☆148Updated 3 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆181Updated 2 years ago
- RUN LENGTH SMOOTHING ALGORITHM(RLSA) is a method mainly used for block segmentation and text discrimination. It helps to extract the nece…☆28Updated last year
- Segmentation of ID Cards using Semantic Segmentation☆101Updated 4 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 3 years ago
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆76Updated 4 months ago