miguelballesteros / LSTM-punctuationLinks
☆11Updated 8 years ago
Alternatives and similar repositories for LSTM-punctuation
Users that are interested in LSTM-punctuation are comparing it to the libraries listed below
Sorting:
- ☆51Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 6 months ago
- ☆22Updated 3 years ago
- Align word sequences and calculate metrics like word error rate (WER)☆23Updated 13 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Updated 5 years ago
- Convert words to numbers☆21Updated 3 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Updated 8 years ago
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆77Updated 4 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Updated 8 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Updated 6 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- RNNs for Text Normalization☆39Updated 7 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- ☆10Updated 4 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 6 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆40Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago
- Corpus preprocessing☆99Updated last year
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- Efficient Markov Chain word alignment☆52Updated 4 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- ☆18Updated 8 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 5 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 4 years ago