divyanshjoshi / Attention-U-Net-Newspaper-Text-Block-SegmentationLinks
Segmenting text blocks and baselines from documents using deep learning techniques
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-U-Net-Newspaper-Text-Block-Segmentation
Users that are interested in Attention-U-Net-Newspaper-Text-Block-Segmentation are comparing it to the libraries listed below
Sorting:
- A neural language modeling toolkit built on PyTorch☆19Updated 2 years ago
- ☆18Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Updated last year
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- Vaksanca introduces free Sanskrit speech corpus with vowel segmentation.☆16Updated 4 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆71Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆18Updated 4 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆63Updated last year
- ☆141Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- This repository contains an implementation of the "Representation Learning for Information Extraction from Form-like Documents" paper.☆25Updated 4 years ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆62Updated last year
- Generate text boxes from input words with a GAN.☆66Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- ☆49Updated 3 years ago
- More than 43+ collections of Thai Natural Language Processing libraries. Update daily.☆32Updated 7 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- ☆17Updated 5 years ago
- Basic HTR concepts/modules to boost performance☆37Updated last year
- ☆17Updated 4 years ago
- I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and…☆17Updated 3 years ago
- ☆45Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Detect handwritten words (neural network based).☆73Updated 3 years ago