ottokart / punctuator2Links
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆684Updated 4 years ago
Alternatives and similar repositories for punctuator2
Users that are interested in punctuator2 are comparing it to the libraries listed below
Sorting:
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 5 years ago
- G2P with Tensorflow☆680Updated last year
- A sentence segmenter that actually works!☆304Updated 5 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆795Updated last month
- g2p: English Grapheme To Phoneme Conversion☆911Updated 3 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆314Updated 4 years ago
- A collection of links and notes on forced alignment tools☆935Updated 4 years ago
- CMU Wilderness Multilingual Speech Dataset☆291Updated 6 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆283Updated 2 years ago
- Simple, fast unsupervised word aligner☆766Updated 3 years ago
- Phonetisaurus G2P☆507Updated last year
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆78Updated 9 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 4 years ago
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 5 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆227Updated last year
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆396Updated 4 years ago
- A Python wrapper for Kaldi☆1,030Updated 2 months ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,222Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆481Updated 5 years ago
- Simple text to phones converter for multiple languages☆1,511Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- Bitextor generates translation memories from multilingual websites☆300Updated last year
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Python interface for forced audio alignment using HTK and SoX☆350Updated 5 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,384Updated last year
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Updated last year
- GStreamer plugin around Kaldi's online neural network decoder☆184Updated 5 years ago
- DeepSpeech based forced alignment tool☆239Updated 5 years ago
- Speaker diarization scripts, based on AaltoASR☆191Updated 7 years ago