monniert / docExtractor
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
☆86Updated last year
Related projects ⓘ
Alternatives and complementary repositories for docExtractor
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆28Updated 3 weeks ago
- dhSegment on pytorch☆32Updated last year
- ☆77Updated last year
- Code and data for the paper at http://arxiv.org/abs/2004.07317☆16Updated 4 years ago
- ☆10Updated last year
- OCR-D python tools☆33Updated 3 months ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- PAGE XML format collection for document image page content and more☆66Updated 3 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated last year
- OCR & Ground Truth Resources☆74Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- Ground Truth Resources for the HTR of patrimonial documents☆39Updated this week
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated last week
- ☆10Updated last year
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆8Updated last year
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Updated 2 weeks ago
- Generic framework for historical document processing☆373Updated 3 years ago
- ☆32Updated 4 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last month
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆35Updated 2 years ago
- Toolbox for OCR post-correction☆123Updated 5 years ago
- Wrapper around pixel classifier☆9Updated 2 years ago
- An OCR evaluation tool☆64Updated last month
- convert PubLayNet data into METS/PAGE-XML☆10Updated 4 years ago
- ☆10Updated 5 years ago
- ☆50Updated this week
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 3 years ago
- Detect textlines in document images☆90Updated 5 months ago
- Augment line images for improving OCR datasets☆9Updated last year