OCR-D / ocrd_segment
OCR-D-compliant page segmentation
☆66Updated 2 weeks ago
Related projects: ⓘ
- Detect textlines in document images☆88Updated 3 months ago
- DFKI Layout Detection for OCR-D☆48Updated 4 months ago
- OCR-D python tools☆33Updated last month
- Document Image Binarization☆71Updated 3 weeks ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆38Updated 3 weeks ago
- Master repository which includes most other OCR-D repositories as submodules☆71Updated 3 weeks ago
- OCR & Ground Truth Resources☆75Updated 2 years ago
- Pretrained mixed models to be used with Calamari.☆55Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆179Updated last month
- Page to PAGE Layout Analysis Tool☆190Updated 2 years ago
- AI_DocumentLayoutAnalysis☆38Updated 3 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆114Updated 9 months ago
- An OCR evaluation tool☆61Updated 2 weeks ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 4 years ago
- A suite of batches and tools for OCR tasks.☆71Updated last year
- Toolbox for OCR post-correction☆122Updated 5 years ago
- Augment line images for improving OCR datasets☆9Updated 11 months ago
- OCR evaluation brought to you by University of Alicante☆66Updated 2 years ago
- ☆72Updated 6 years ago
- A deep learning toolkit specialized for handwritten document analysis☆202Updated 2 weeks ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆44Updated 5 months ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆34Updated 2 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆118Updated this week
- ☆20Updated 5 years ago
- Detect handwritten words (neural network based).☆64Updated 2 years ago
- ☆135Updated last year
- document image degradation☆155Updated 4 years ago
- PAGE XML format collection for document image page content and more☆62Updated 3 years ago
- Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support☆55Updated 3 years ago
- ☆16Updated 2 years ago