Layout-Parser / platform
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for platform
- An OCR evaluation tool☆64Updated last month
- OCR & Ground Truth Resources☆74Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- A suite of batches and tools for OCR tasks.☆71Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆173Updated last year
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last month
- multimodal document analysis☆160Updated 5 months ago
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- Tools for evaluating OCR performance relative to ground truth.☆10Updated 10 months ago
- Object Detection Model for Scanned Documents☆83Updated last year
- spaCy entry points for Curated Transformers☆25Updated last month
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆66Updated last year
- Logical structure analysis for visually structured documents☆84Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated 9 months ago
- Document Image Binarization☆73Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- link raw affiliation to ROR ids☆25Updated last year
- Integrate AI-powered Document Analysis Pipelines☆62Updated this week
- PAGE XML format collection for document image page content and more☆66Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆203Updated last year
- Seed Machine Translation Data☆30Updated last week
- Collection of OCR-related python tools and wrappers from @OCR-D☆119Updated this week
- ☆75Updated 2 years ago
- ☆10Updated 5 years ago
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆65Updated 4 years ago
- Glyph Miner, a system for extracting glyphs from early typeset prints☆34Updated 8 years ago