Layout-Parser / platform
☆10Updated 3 years ago
Alternatives and similar repositories for platform:
Users that are interested in platform are comparing it to the libraries listed below
- An OCR evaluation tool☆65Updated last month
- DFKI Layout Detection for OCR-D☆47Updated this week
- OCR & Ground Truth Resources☆74Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Updated 8 years ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆72Updated this week
- link raw affiliation to ROR ids☆29Updated last year
- Conversions between various OCR formats☆74Updated last year
- ☆12Updated 11 months ago
- ☆67Updated last year
- Tools for evaluating OCR performance relative to ground truth.☆10Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆175Updated 2 years ago
- Named entity annotation tool☆27Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Document Image Binarization☆77Updated 5 months ago
- Two-Step Approach to OCR Post-Correction☆14Updated 10 months ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆15Updated 7 months ago
- OCR-D wrapper for detectron2 based segmentation models☆16Updated 5 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆41Updated this week
- Python tools for performing various operations on ALTO XML files☆45Updated last month
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆104Updated 7 months ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆36Updated last year
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 8 months ago