divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-SegmentationLinks
Segmenting text blocks and baselines from documents using deep learning techniques
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-U-Net-Newspaper-Text-Block-Segmentation
Users that are interested in Attention-U-Net-Newspaper-Text-Block-Segmentation are comparing it to the libraries listed below
Sorting:
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- A neural language modeling toolkit built on PyTorch☆18Updated 2 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆72Updated 2 years ago
- Detect handwritten words (neural network based).☆71Updated 3 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆56Updated 10 months ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Handwritten text recognition using transformers.☆157Updated last year
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- ☆16Updated 2 years ago
- I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and…☆17Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ☆139Updated last year
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- ☆52Updated last month
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- ☆43Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- DFKI Layout Detection for OCR-D☆47Updated 3 months ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆11Updated 6 months ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆63Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆17Updated 4 years ago
- This is the github repository for converting craft pretrained model to tflite version☆35Updated 4 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆61Updated 11 months ago
- Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.☆18Updated 9 months ago