dhSegment on pytorch
☆35Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for dhSegment-torch
Users that are interested in dhSegment-torch are comparing it to the libraries listed below
Sorting:
- ☆14Jul 11, 2022Updated 3 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Jan 30, 2021Updated 5 years ago
- ☆16Feb 16, 2023Updated 3 years ago
- Generic framework for historical document processing☆382Jul 9, 2021Updated 4 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handw…☆21Jan 12, 2026Updated last month
- ☆10Aug 5, 2019Updated 6 years ago
- Computer Vision and Deep Learning tutorials for the course Foundation of Digital Humanities☆10Dec 6, 2019Updated 6 years ago
- ☆66Feb 3, 2026Updated last month
- uncover old chinese textual parallels based on sound☆15Feb 23, 2026Updated last week
- (ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper☆88May 25, 2023Updated 2 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Nov 7, 2024Updated last year
- Quicksign OCRized Text Dataset (QS-OCR)☆45May 7, 2019Updated 6 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 10 months ago
- ☆24Dec 8, 2022Updated 3 years ago
- ☆28Jul 17, 2019Updated 6 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- ☆12Nov 3, 2024Updated last year
- Document Layout Analysis☆398Mar 1, 2026Updated last week
- Page to PAGE Layout Analysis Tool☆191Jan 17, 2022Updated 4 years ago
- Detectron2 for Document Layout Analysis☆187Aug 2, 2024Updated last year
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆33Jul 20, 2022Updated 3 years ago
- A curated list of digital things related to the field of Chinese studies.☆34Sep 4, 2020Updated 5 years ago
- Deep learning based page layout analysis☆195Apr 24, 2019Updated 6 years ago
- Convolutional Neural Network (CNN) was trained on 48x48 pixel grayscale images to predict 5 different emotions from images. Ten different…☆11Sep 21, 2022Updated 3 years ago
- 结合截图生成干净的百度热力图☆17Jun 24, 2023Updated 2 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆11Jun 13, 2025Updated 8 months ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35May 25, 2023Updated 2 years ago
- Documentation and use cases for ALTO XML☆42Sep 10, 2018Updated 7 years ago
- Document Image Binarization☆79Oct 17, 2024Updated last year
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- (CRNN) Chinese Characters Recognition. add Backbone network resnet18 senet☆10Oct 20, 2021Updated 4 years ago
- ☆11Aug 7, 2022Updated 3 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Dec 13, 2018Updated 7 years ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated last year
- Dewey Data Inc. Python API☆14Jul 2, 2025Updated 8 months ago
- ☆11Sep 17, 2021Updated 4 years ago
- C++ code and documentation for the MFlash PKDD'16 publication☆10Oct 25, 2016Updated 9 years ago