BobLd / simple-docstrum
A step-by-step C# implementation of the Docstrum algorithm
☆23Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for simple-docstrum
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 3 years ago
- PAGE XML format collection for document image page content and more☆66Updated 3 years ago
- Document Layout Analysis☆345Updated this week
- PDF to XML ALTO file converter☆215Updated last month
- OCR-D python tools☆33Updated 2 months ago
- ☆69Updated 6 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆180Updated this week
- DFKI Layout Detection for OCR-D☆47Updated last week
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆202Updated last year
- OCR & Ground Truth Resources☆74Updated 2 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 3 weeks ago
- Detect textlines in document images☆90Updated 5 months ago
- Toolbox for OCR post-correction☆123Updated 5 years ago
- Conversions between various OCR formats☆71Updated last year
- OCR-D-compliant page segmentation☆66Updated 2 months ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 4 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- ☆36Updated 4 years ago
- The hOCR Embedded OCR Workflow and Output Format☆73Updated 3 months ago
- AI_DocumentLayoutAnalysis☆38Updated 3 years ago
- Document Image Binarization☆73Updated 3 weeks ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆119Updated this week
- ☆74Updated 2 years ago
- Fork of dhSegment for experiments on visual and textual feature combination.☆15Updated 3 years ago
- ☆87Updated 4 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆38Updated 3 years ago
- A simple document layout analysis using Python-OpenCV☆123Updated 4 years ago
- Detectron2 for Document Layout Analysis☆185Updated 3 months ago