harisbinzia / Urdu-Word-Segmentation
Urdu Word Segmentation using Conditional Random Fields (CRFs)
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Urdu-Word-Segmentation
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 5 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Dialect identification using Siamese network☆15Updated 6 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆14Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Updated 7 years ago
- Text normalization scripts from IRISA lab☆12Updated 6 years ago
- Pronounce Arabic words☆18Updated 5 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Demo and samples for universal speech translator☆22Updated last year
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated last year
- Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://arxiv.org/abs/2004.08022☆11Updated 3 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 4 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 6 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated last year
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- Code for NAACL 2019 paper: "Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions"☆16Updated last year
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Updated 7 years ago