polm / ndl-crop
Script for cropping photos from the NDL.
☆38Updated 2 years ago
Alternatives and similar repositories for ndl-crop:
Users that are interested in ndl-crop are comparing it to the libraries listed below
- Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.☆158Updated 2 years ago
- Working with hOCR in Javascript☆124Updated last year
- Smart crops images uisng OpenCV☆39Updated 6 years ago
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated 7 months ago
- Tutorial on how to deskew (straighten) text images☆51Updated 2 years ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆93Updated 6 years ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- OCR-D-compliant page segmentation☆67Updated last week
- Image to dense vector embedding. Clone of https://github.com/christiansafka/img2vec for Keras users☆37Updated 5 years ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 6 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆184Updated 2 months ago
- Implementation of perceptual image hash calculation in Python☆131Updated last year
- Wrapper around pixel classifier☆9Updated 2 years ago
- Framework to build your own reverse image search engine☆80Updated 4 years ago
- Apply different text recognition services to images of handwritten documents.☆174Updated 2 years ago
- Python package for Stroke Width Transform - Localizing the Text (Letters & Words) in a Natural Image☆37Updated last year
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆186Updated 2 weeks ago
- Augment line images for improving OCR datasets☆9Updated last year
- POC for similarity search by abstract features☆41Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆382Updated 6 months ago
- Detect and fix skew in images containing text☆263Updated 5 years ago
- Deep learning based page layout analysis☆195Updated 5 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆23Updated 4 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Updated 3 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Image Annotation Tool and Image Search☆14Updated 2 weeks ago
- A collection of tools for cleaning up book scans.☆138Updated 2 years ago
- HOCR Specification Python Parser☆13Updated 9 years ago
- A basic duplicate image detection service using perceptual image hash functions and nearest neighbor search, implemented using faiss, fas…☆31Updated 3 years ago