divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-SegmentationLinks
Segmenting text blocks and baselines from documents using deep learning techniques
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-U-Net-Newspaper-Text-Block-Segmentation
Users that are interested in Attention-U-Net-Newspaper-Text-Block-Segmentation are comparing it to the libraries listed below
Sorting:
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- Detect handwritten words (neural network based).☆71Updated 3 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆58Updated last year
- Handwritten text recognition using transformers.☆157Updated last year
- A neural language modeling toolkit built on PyTorch☆19Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- Attention-based sequence-to-sequence model for handwritten word recognition☆62Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ☆16Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆139Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆219Updated 9 months ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆72Updated 2 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆32Updated 3 years ago
- Pytorch Implementation of TableNet☆67Updated 4 years ago
- ☆17Updated 4 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- ☆11Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated 11 months ago
- I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and…☆17Updated 3 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆73Updated last year
- Evaluation of the Layoutlm model on the CORD dataset☆32Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- TextTron is a simple light-weight image processing based text detector for document images.☆53Updated 4 years ago
- ☆47Updated 2 years ago