☆11Jun 13, 2025Updated 9 months ago
Alternatives and similar repositories for DAN
Users that are interested in DAN are comparing it to the libraries listed below
Sorting:
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated 2 months ago
- ☆16Feb 16, 2023Updated 3 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Nov 7, 2024Updated last year
- This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handw…☆21Jan 12, 2026Updated 2 months ago
- You Actually Look Twice At it☆39Jan 21, 2025Updated last year
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 9 months ago
- ☆19Oct 1, 2021Updated 4 years ago
- Annotation tool (NER) for XML documents (TEI, EAD) - WIP☆11Jul 22, 2022Updated 3 years ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Oct 31, 2024Updated last year
- OCR post correction for old German corpus☆20Aug 29, 2022Updated 3 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Mar 12, 2026Updated last week
- A Toolkit to Generate Structured Historical Documents☆15Jun 27, 2020Updated 5 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆15Jan 20, 2026Updated 2 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- ☆66Mar 12, 2026Updated last week
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- (ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper☆88May 25, 2023Updated 2 years ago
- ☆29Jul 17, 2019Updated 6 years ago
- Page-wise text recognition with lower-supervision line data models☆51Mar 11, 2026Updated last week
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- A deep learning toolkit specialized for handwritten document analysis☆253Oct 26, 2025Updated 4 months ago
- ☆48Dec 16, 2022Updated 3 years ago
- Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"☆17Oct 13, 2021Updated 4 years ago
- Transcription corpora for training HTR models for medieval manuscripts from the 12th to the 15th century.☆25Jan 17, 2025Updated last year
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56May 30, 2023Updated 2 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆64Sep 22, 2024Updated last year
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆17May 18, 2022Updated 3 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago