BobLd / simple-docstrumLinks
A step-by-step C# implementation of the Docstrum algorithm
☆23Updated 4 years ago
Alternatives and similar repositories for simple-docstrum
Users that are interested in simple-docstrum are comparing it to the libraries listed below
Sorting:
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47Updated last month
- ☆69Updated 7 years ago
- OCR & Ground Truth Resources☆75Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆186Updated last week
- Master repository which includes most other OCR-D repositories as submodules☆73Updated 2 weeks ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Detect textlines in document images☆93Updated last year
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆22Updated 6 years ago
- OCR-D python tools☆33Updated 9 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated 2 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated 2 weeks ago
- Conversions between various OCR formats☆77Updated 2 years ago
- Detectron2 for Document Layout Analysis☆187Updated 10 months ago
- OCR-D-compliant page segmentation☆67Updated 3 weeks ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆211Updated last year
- PDF to XML ALTO file converter☆240Updated last week
- AI_DocumentLayoutAnalysis☆39Updated 4 years ago
- Document Layout Analysis☆376Updated 3 weeks ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Document Understanding tools☆21Updated 3 years ago
- Framework for information extraction from tables☆41Updated 6 years ago
- Toolbox for OCR post-correction☆121Updated 5 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆128Updated last week
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆67Updated 4 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆57Updated 4 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 10 months ago
- A suite of batches and tools for OCR tasks.☆71Updated 2 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆19Updated this week