harisbinzia / Urdu-Word-Segmentation
Urdu Word Segmentation using Conditional Random Fields (CRFs)
☆12Updated 6 years ago
Alternatives and similar repositories for Urdu-Word-Segmentation:
Users that are interested in Urdu-Word-Segmentation are comparing it to the libraries listed below
- uyghur text resource crawled from website☆12Updated 9 years ago
- Punctuation restoration in ASR text☆32Updated 5 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Updated 5 years ago
- Demo and samples for universal speech translator☆23Updated 2 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 4 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP☆24Updated 4 years ago
- Aligned bilingual word vectors for English and Chinese☆11Updated 6 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques☆35Updated 6 years ago
- repo for Tibetan corpora☆21Updated last year
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Updated last year
- CRFs based Chinese word segmentor☆19Updated 10 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆42Updated 5 years ago
- Filter dialog data with a simple entropy-based method (see ACL paper)☆14Updated 5 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 5 years ago
- RNNs for Text Normalization☆38Updated 7 years ago
- This repository is used to publish our codes for the conference paper "Vietnamese punctuation prediction using deep neural networks" at S…☆10Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Updated 9 months ago
- Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seq☆18Updated 5 years ago
- ChineseWord correct!!when you input some error words,return some maybe right word☆8Updated 10 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆51Updated 4 years ago