mzucker / page_dewarp
Text page dewarping using a "cubic sheet" model
☆1,453Updated last year
Alternatives and similar repositories for page_dewarp:
Users that are interested in page_dewarp are comparing it to the libraries listed below
- An OpenCV based document scanner☆801Updated 8 years ago
- Perspective recovery of text using transformed ellipses☆148Updated 3 years ago
- A post-processing tool for scanned sheets of paper.☆1,055Updated 6 months ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,273Updated 4 years ago
- Convert scans of handwritten notes to beautiful, compact PDFs☆4,815Updated 9 months ago
- A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.☆2,230Updated 2 years ago
- Document Image Dewarping☆347Updated 5 years ago
- ☆1,675Updated 4 years ago
- A collection of tools for cleaning up book scans.☆137Updated 2 years ago
- Detect and fix skew in images containing text☆261Updated 5 years ago
- Line based ATR Engine based on OCRopy☆1,065Updated 2 months ago
- Python-based tools for document analysis and OCR☆3,432Updated 3 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆823Updated 5 years ago
- Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)☆512Updated 2 months ago
- Handwritten math expression parser☆681Updated 4 years ago
- OCR engine for all the languages☆767Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆375Updated 5 months ago
- Library used to deskew a scanned document☆434Updated last week
- Document image dewarping library using a cubic sheet model☆129Updated this week
- ImagePlay is a rapid prototyping application for image processing☆1,169Updated last year
- A Python wrapper for the tesseract-ocr API☆2,042Updated last month
- Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf fi…☆1,582Updated last year
- A simple python OCR engine using opencv☆527Updated 11 months ago
- An application of high resolution GANs to dewarp images of perturbed documents☆131Updated 3 years ago
- Document Layout Analysis☆359Updated 3 weeks ago
- Links to awesome OCR projects☆2,866Updated 6 months ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,348Updated last year
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.☆511Updated 3 years ago
- A selectional auto-encoder approach for document image binarization☆102Updated 2 years ago