A Toolkit to Generate Structured Historical Documents
☆15Jun 27, 2020Updated 5 years ago
Alternatives and similar repositories for DocEmul
Users that are interested in DocEmul are comparing it to the libraries listed below
Sorting:
- Named Entity Recognition☆19Feb 13, 2026Updated last month
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Scene text rectification using glyph and character alignment properties☆22Jan 21, 2018Updated 8 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- ☆11Jun 13, 2025Updated 9 months ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Agda formalization of the paper, "Higher-Order Functions and Brouwer's Thesis". Deduces a Brouwer ordinal from a function ((nat -> nat) -…☆13Sep 22, 2020Updated 5 years ago
- Code for DNN feature map compression paper☆11Nov 21, 2018Updated 7 years ago
- Offline android OCR app using deep learning☆22Sep 7, 2018Updated 7 years ago
- ☆10Nov 19, 2020Updated 5 years ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- The OCRopus OCR System☆11Dec 17, 2014Updated 11 years ago
- A set of awesome content about Data Augmentation for Deep Learning and other stuff!!!☆15Nov 27, 2020Updated 5 years ago
- Parser for Valgrind's massif.out file format.☆20Mar 17, 2013Updated 13 years ago
- An implementation of (some fragment of) cubical type theory using rewrite rules, based on a talk given by Conor McBride at the 23rd Agda'…☆12Aug 5, 2016Updated 9 years ago
- A little JavaScript application that wants to help learning the bandoneon.☆18Updated this week
- Nonlinear SVGD for Learning Diversified Mixture Models☆13Jan 23, 2019Updated 7 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆34Nov 24, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Detect textlines in document images☆91May 27, 2024Updated last year
- A new lossless data compression algorithm☆12Nov 19, 2025Updated 4 months ago
- ☆13Feb 18, 2020Updated 6 years ago
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- Printed and handwritten text segmentation using fully convolutional networks and CRF post-processing☆41Jan 14, 2021Updated 5 years ago
- Submission for DIBCO 2017☆16Sep 11, 2017Updated 8 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Feb 22, 2021Updated 5 years ago
- Generic API for dispatch to Pyro backends.☆16Feb 13, 2022Updated 4 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- Code for Font Classification Networks☆13Sep 11, 2017Updated 8 years ago
- ☆11Sep 17, 2021Updated 4 years ago
- An OCR evaluation tool☆69Aug 22, 2025Updated 6 months ago
- AI_DocumentLayoutAnalysis☆39Nov 25, 2020Updated 5 years ago
- NLP-helper for OCR-ed pages in PAGE XML format