Layout-Parser / platform
☆10Updated 2 years ago
Alternatives and similar repositories for platform:
Users that are interested in platform are comparing it to the libraries listed below
- An OCR evaluation tool☆65Updated last week
- OCR & Ground Truth Resources☆74Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- multimodal document analysis☆162Updated 8 months ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated last year
- A suite of batches and tools for OCR tasks.☆71Updated last year
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆126Updated this week
- Seed Machine Translation Data☆30Updated 3 months ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆54Updated last year
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last week
- Ground Truth Resources for the HTR of patrimonial documents☆40Updated this week
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆175Updated last year
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆67Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- TeX compilation service that makes use of arXiv.org's AutoTeX library.☆28Updated 8 months ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆205Updated last year
- ☆67Updated 11 months ago
- Conversions between various OCR formats☆74Updated last year
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 2 years ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆102Updated 5 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- In-browser OCR of Ancient Greek and Latin☆26Updated this week
- OCR-D-compliant page segmentation☆67Updated last week