viswavi / languageid
Identifying the language of input text using character-level n-grams, with support for 45 languages
☆11Updated 2 years ago
Alternatives and similar repositories for languageid:
Users that are interested in languageid are comparing it to the libraries listed below
- Speech2vec pre-trained word vectors☆76Updated 6 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- Fast parallel CTC.☆31Updated 6 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 5 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 6 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- Python library for n-gram models in ARPA format☆40Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- English text corrector by using deep neural networks in Pytorch☆47Updated 7 years ago
- Tensorflow with KenLM integrated for beam search scoring☆34Updated 7 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 5 months ago
- Python binding for SRI Language Modeling Toolkit implemented in Cython☆29Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆73Updated last year
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago
- ☆45Updated 5 years ago
- Levenshtein edit-distance on PyTorch and CUDA☆94Updated 2 years ago
- ☆74Updated 3 years ago
- SWIG Wrapper for the SRILM toolkit☆33Updated 4 years ago
- ☆21Updated 5 years ago
- PyTorch bindings for Warp-CTC☆42Updated 5 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 7 years ago
- RNNs for Text Normalization☆38Updated 7 years ago
- PyTorch CTC Decoder bindings☆42Updated 7 years ago
- eXtensible Neural Machine Translation☆185Updated 5 years ago
- PyTorch CTC Decoder bindings☆14Updated 7 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 6 months ago