Layout-Parser / platformLinks
☆10Updated 3 years ago
Alternatives and similar repositories for platform
Users that are interested in platform are comparing it to the libraries listed below
Sorting:
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- multimodal document analysis☆166Updated 2 months ago
- OCR & Ground Truth Resources☆76Updated 3 years ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 7 months ago
- Document Layout Analysis☆395Updated this week
- DFKI Layout Detection for OCR-D☆47Updated 9 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆218Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆38Updated 2 years ago
- PAGE XML format collection for document image page content and more☆69Updated 3 weeks ago
- A deep learning toolkit specialized for handwritten document analysis☆252Updated 3 months ago
- Software that makes labeling PDFs easy.☆426Updated last year
- Collection of OCR-related python tools and wrappers from @OCR-D☆133Updated this week
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 5 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56Updated 2 years ago
- OCR-D-compliant page segmentation☆68Updated 2 months ago
- Libraries, Archives and Museums (LAM)☆88Updated 3 years ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 9 months ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 5 years ago
- ☆141Updated last year
- Run ONNX and TensorFlow inference in the browser.☆75Updated 3 years ago
- An OCR evaluation tool☆68Updated 5 months ago
- Logical structure analysis for visually structured documents☆93Updated 3 years ago
- An index of PDF-centric corpora☆161Updated 7 months ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆50Updated 2 years ago
- Document Image Binarization☆79Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 4 years ago
- dhSegment on pytorch☆35Updated 2 years ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆199Updated 8 months ago