divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-Segmentation
Segmenting text blocks and baselines from documents using deep learning techniques
☆12Updated 3 years ago
Alternatives and similar repositories for Attention-U-Net-Newspaper-Text-Block-Segmentation:
Users that are interested in Attention-U-Net-Newspaper-Text-Block-Segmentation are comparing it to the libraries listed below
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆28Updated 3 years ago
- ☆80Updated last year
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Implementation of PHOSNet and Pho(SC)Net for Word Recognition in Historical Documents. Implemented using Tensorflow 2.x☆8Updated 2 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆56Updated 6 months ago
- Repository sharing code and the model for the paper "Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-Prefixes"☆15Updated 3 years ago
- Detect handwritten words (neural network based).☆68Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 6 months ago
- ☆72Updated 2 years ago
- ☆15Updated 2 years ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- ☆21Updated 5 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆50Updated 5 months ago
- Working codes for project☆23Updated last year
- ☆33Updated 4 years ago
- Handwritten text recognition using transformers.☆157Updated 8 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- RUN LENGTH SMOOTHING ALGORITHM(RLSA) is a method mainly used for block segmentation and text discrimination. It helps to extract the nece…☆28Updated last year
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Updated 4 months ago
- ☆15Updated 2 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆59Updated last year
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆37Updated 2 years ago
- ☆15Updated 8 months ago
- TextTron is a simple light-weight image processing based text detector for document images.☆52Updated 4 years ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆45Updated 8 months ago
- ☆22Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆36Updated last year