BobLd / simple-docstrum
A step-by-step C# implementation of the Docstrum algorithm
☆23Updated 4 years ago
Alternatives and similar repositories for simple-docstrum:
Users that are interested in simple-docstrum are comparing it to the libraries listed below
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- ☆69Updated 6 years ago
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆205Updated last year
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- Document Layout Analysis☆359Updated 3 weeks ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- PDF to XML ALTO file converter☆223Updated last month
- Detect textlines in document images☆91Updated 8 months ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆181Updated 2 months ago
- Toolbox for OCR post-correction☆122Updated 5 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆38Updated 4 years ago
- Conversions between various OCR formats☆74Updated last year
- AI_DocumentLayoutAnalysis☆38Updated 4 years ago
- OCR-D python tools☆33Updated 6 months ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 6 months ago
- ☆77Updated 2 years ago
- A simple document layout analysis using Python-OpenCV☆124Updated 4 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆125Updated this week
- Document Image Binarization☆76Updated 3 months ago
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆22Updated 5 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆54Updated last year
- The CIS OCR PostCorrectionTool☆41Updated 2 years ago
- Detectron2 for Document Layout Analysis☆185Updated 6 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆185Updated last week
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- ☆129Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆102Updated 5 months ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated this week