shreyshah97 / Newspaper-SegmentationLinks
Newspaper Segmentation into images and text
☆12Updated 6 years ago
Alternatives and similar repositories for Newspaper-Segmentation
Users that are interested in Newspaper-Segmentation are comparing it to the libraries listed below
Sorting:
- Detect handwritten words (neural network based).☆70Updated 3 years ago
- Segmenting text blocks and baselines from documents using deep learning techniques☆13Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47Updated last month
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆29Updated 4 years ago
- ☆23Updated 5 years ago
- ☆11Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- Document processing using transformers☆21Updated 2 years ago
- OCR-D python tools☆33Updated 10 months ago
- OCR-D-compliant page segmentation☆67Updated last month
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆13Updated 10 months ago
- ☆139Updated last year
- CVPR 2022: Table Structure Recognition☆40Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated 2 years ago
- OCR post correction for old German corpus☆19Updated 2 years ago
- Tool for enhancing noisy scanned text images☆48Updated 5 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆11Updated 10 months ago
- TensorFlow implementation of a segmentation system for document images.☆34Updated 6 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 10 months ago
- Python 3 library for processing historical English☆67Updated 10 months ago
- Code and data for the paper at http://arxiv.org/abs/2004.07317☆16Updated 4 years ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated this week
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆38Updated 3 years ago
- ☆13Updated 2 years ago
- end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace☆11Updated last year
- Detect textlines in document images☆93Updated last year