shreyshah97 / Newspaper-SegmentationLinks

Newspaper Segmentation into images and text

☆12

Alternatives and similar repositories for Newspaper-Segmentation

Users that are interested in Newspaper-Segmentation are comparing it to the libraries listed below

Sorting:

githubharald / WordDetectorNN
Detect handwritten words (neural network based).
☆70Updated 3 years ago
divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-Segmentation
Segmenting text blocks and baselines from documents using deep learning techniques
☆13Updated 3 years ago
OCR-D / ocrd_anybaseocr
DFKI Layout Detection for OCR-D
☆47Updated last month
kennethleungty / OCR-Metrics-CER-WER
Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer
☆29Updated 4 years ago
seuretm / printed-vs-handwritten
☆23Updated 5 years ago
ruathudo / post-ocr-correction
☆11Updated 3 years ago
stefan-it / europeana-bert
BERT and ELECTRA models trained on Europeana Newspapers
☆38Updated 3 years ago
TurkuNLP / ocr-correction
Post-processing OCR errors with seq2seq models
☆28Updated 4 years ago
Vishnunkumar / doc_transformers
Document processing using transformers
☆21Updated 2 years ago
cisocrgroup / ocrd_cis
OCR-D python tools
☆33Updated 10 months ago
OCR-D / ocrd_segment
OCR-D-compliant page segmentation
☆67Updated last month
bertsky / ocrd_publaynet
convert PubLayNet data into METS/PAGE-XML
☆10Updated 5 years ago
jarobyte91 / post_ocr_correction
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆37Updated last year
uniwue-zpd / PAGETools
Small collection of PAGE XML related scripts used at the ZPD Würzburg
☆13Updated 10 months ago
shrutirij / ocr-post-correction
☆139Updated last year
phamquiluan / table-transformer
CVPR 2022: Table Structure Recognition
☆40Updated 3 years ago
natliblux / nautilusocr
METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)
☆53Updated 2 years ago
GarfieldLyu / OCR_POST_DE
OCR post correction for old German corpus
☆19Updated 2 years ago
mauvilsa / imgtxtenh
Tool for enhancing noisy scanned text images
☆48Updated 5 years ago
PD-Mera / ctranslate2-triton-backend
Triton backend for https://github.com/OpenNMT/CTranslate2
☆11Updated 10 months ago
gaxler / dataset_agnostic_segmentation
TensorFlow implementation of a segmentation system for document images.
☆34Updated 6 years ago
impresso / CLEF-HIPE-2020
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…
☆22Updated 10 months ago
mikahama / natas
Python 3 library for processing historical English
☆67Updated 10 months ago
poke1024 / bbz-segment
Code and data for the paper at http://arxiv.org/abs/2004.07317
☆16Updated 4 years ago
OCR-D / ocrd_all
Master repository which includes most other OCR-D repositories as submodules
☆73Updated this week
swapnil-ahlawat / Document_Layout_Analysis-MonkAI
DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…
☆26Updated 4 years ago
herobd / NAF_dataset
Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.
☆38Updated 3 years ago
mittagessen / curt
☆13Updated 2 years ago
yang0369 / Information_Extraction
end-to-end information extraction pipeline built by LayoutLMV2, pretrained model from HuggingFace
☆11Updated last year
qurator-spk / sbb_textline_detection
Detect textlines in document images
☆93Updated last year