This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handwritten documents)
☆20Jan 12, 2026Updated last month
Alternatives and similar repositories for daniel
Users that are interested in daniel are comparing it to the libraries listed below
Sorting:
- ☆16Feb 16, 2023Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- The repository provides access to the source code for Transcription Pearl, an Handwritten Text Recognition (HTR) tool, that uses AI to tr…☆53Apr 29, 2025Updated 9 months ago
- ☆11Jun 13, 2025Updated 8 months ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents☆28Oct 5, 2024Updated last year
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- ☆28Jul 17, 2019Updated 6 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Basic HTR concepts/modules to boost performance☆39Nov 30, 2024Updated last year
- Page-wise text recognition with lower-supervision line data models☆51Feb 21, 2026Updated last week
- dhSegment on pytorch☆35Jun 12, 2023Updated 2 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 3 months ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- 中央資工系學會 NAS 的使用說明與規範☆14Aug 20, 2025Updated 6 months ago
- ☆11Sep 17, 2021Updated 4 years ago
- Miqra According to the Masorah in two JSON formats☆12Feb 13, 2026Updated 2 weeks ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆18Jun 29, 2024Updated last year
- 西方学者普遍从汉字部件出发理解汉字,该库给出了中文部件分解的详细说明和数据库。☆11Jul 20, 2023Updated 2 years ago
- ☆14Jun 5, 2020Updated 5 years ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆15Aug 22, 2025Updated 6 months ago
- A lexical normalizer for historical spelling variants using a transformer architecture.☆10Mar 12, 2025Updated 11 months ago
- Ground Truth Resources for the HTR of patrimonial documents☆47Feb 15, 2026Updated last week
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆12Updated this week
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 2 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated last year
- Original GOKb repo - Moving to https://github.com/openlibraryenvironment/gokb☆11Jan 23, 2018Updated 8 years ago
- ☆11Mar 10, 2018Updated 7 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Digital texts in Prakrit☆10Sep 14, 2025Updated 5 months ago
- version 4.x of the Princeton Geniza Project☆12Feb 18, 2026Updated last week
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆75Dec 22, 2025Updated 2 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- Use any vision LLMs to perform OCR using LangChain☆18Jul 29, 2025Updated 6 months ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago