monniert / docExtractor
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
☆88Updated last year
Alternatives and similar repositories for docExtractor:
Users that are interested in docExtractor are comparing it to the libraries listed below
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆31Updated 5 months ago
- dhSegment on pytorch☆34Updated last year
- ☆81Updated last year
- ☆10Updated 2 years ago
- ☆10Updated 2 years ago
- ☆9Updated 4 years ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Updated 2 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆42Updated last week
- Code and data for the paper at http://arxiv.org/abs/2004.07317☆16Updated 4 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated 2 years ago
- OCR-D python tools☆33Updated 7 months ago
- Toolbox for OCR post-correction☆121Updated 5 years ago
- DFKI Layout Detection for OCR-D☆47Updated this week
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 8 months ago
- OCR & Ground Truth Resources☆75Updated 2 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- ☆58Updated last week
- Repository of the back end implementation of DivaServices☆14Updated 5 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last week
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- An OCR evaluation tool☆65Updated last month
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- OCR-D-compliant page segmentation☆67Updated 3 weeks ago
- Generic framework for historical document processing☆375Updated 3 years ago
- ☆24Updated last year
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆37Updated 2 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Updated 4 months ago