(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
☆88May 25, 2023Updated 2 years ago
Alternatives and similar repositories for docExtractor
Users that are interested in docExtractor are comparing it to the libraries listed below
Sorting:
- (NeurIPS 2020 oral) Code for "Deep Transformation-Invariant Clustering" paper☆77Oct 31, 2022Updated 3 years ago
- (ICCV 2021) Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper☆46Feb 1, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 5 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Nov 7, 2024Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Oct 31, 2024Updated last year
- Ground Truth Resources for the HTR of patrimonial documents☆47Feb 15, 2026Updated last week
- (ECCV 2020 Spotlight) Pixel-Pair Occlusion Relationship Map (P2ORM): Formulation, Inference & Application☆46Sep 2, 2022Updated 3 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆66Feb 3, 2026Updated 3 weeks ago
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Oct 27, 2022Updated 3 years ago
- Library in C++ and a python wrapper for dealing with Page XML files☆13Apr 25, 2025Updated 10 months ago
- (ECCV 2020) PyTorch implementation of paper "Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild"☆57Dec 21, 2020Updated 5 years ago
- Tools for TICCL☆14Dec 12, 2025Updated 2 months ago
- Django web application to display, annotate, and export digitized books.☆33Feb 17, 2026Updated last week
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Feb 6, 2026Updated 3 weeks ago
- OCR post correction for old German corpus☆19Aug 29, 2022Updated 3 years ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 8 months ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47May 1, 2025Updated 9 months ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS☆24Apr 17, 2025Updated 10 months ago
- dhSegment on pytorch☆35Jun 12, 2023Updated 2 years ago
- convert NDNP data to IIIF☆12Jun 7, 2016Updated 9 years ago
- OCR-D python tools☆33Aug 16, 2024Updated last year
- (ECCV 2022) Code for Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency☆166Dec 15, 2022Updated 3 years ago
- ☆11Jun 13, 2025Updated 8 months ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- A Toolkit to Generate Structured Historical Documents☆15Jun 27, 2020Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- VIKUS IIIF Generator☆17Oct 28, 2025Updated 3 months ago
- Repository of the back end implementation of DivaServices☆14May 3, 2019Updated 6 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated last year
- Digital Resource for and Database of Paleography, Manuscripts and Diplomatic☆65Oct 13, 2025Updated 4 months ago
- (3DV 2021 oral) PyTorch implementation of paper "PoseContrast: Class-Agnostic Object Viewpoint Estimation in the Wild with Pose-Aware Con…☆45Dec 18, 2023Updated 2 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Dec 31, 2020Updated 5 years ago
- OCR-D-compliant page segmentation☆68Nov 19, 2025Updated 3 months ago
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago