divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-SegmentationLinks
Segmenting text blocks and baselines from documents using deep learning techniques
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-U-Net-Newspaper-Text-Block-Segmentation
Users that are interested in Attention-U-Net-Newspaper-Text-Block-Segmentation are comparing it to the libraries listed below
Sorting:
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- A neural language modeling toolkit built on PyTorch☆19Updated 2 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆72Updated 2 years ago
- Detect handwritten words (neural network based).☆72Updated 3 years ago
- ☆16Updated 2 years ago
- Handwritten text recognition using transformers.☆158Updated last year
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- ☆11Updated 3 years ago
- ☆140Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Basic HTR concepts/modules to boost performance☆33Updated 11 months ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆62Updated last year
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆62Updated last year
- Detect textlines in document images☆92Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Updated last year
- ☆33Updated 5 years ago
- OCR-D-compliant page segmentation☆68Updated last week
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Updated 5 years ago
- Working codes for project☆23Updated 2 years ago
- Newspaper Segmentation into images and text☆12Updated 6 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆60Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- More than 43+ collections of Thai Natural Language Processing libraries. Update daily.☆30Updated 7 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Updated last year