ottokart / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆676Updated 3 years ago
Alternatives and similar repositories for punctuator2:
Users that are interested in punctuator2 are comparing it to the libraries listed below
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 4 years ago
- A sentence segmenter that actually works!☆306Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆213Updated 9 months ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆78Updated 8 years ago
- g2p: English Grapheme To Phoneme Conversion☆849Updated 2 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆715Updated 2 weeks ago
- G2P with Tensorflow☆673Updated 9 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆179Updated 5 years ago
- Phonetisaurus G2P☆473Updated 11 months ago
- Simple, fast unsupervised word aligner☆752Updated 2 years ago
- Open-Source Neural Machine Translation in Tensorflow☆797Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,213Updated 7 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆234Updated 6 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆444Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,324Updated 11 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 6 years ago
- CMU Wilderness Multilingual Speech Dataset☆279Updated 6 years ago
- Sentence paraphrase generation at the sentence level☆407Updated 2 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 8 months ago
- Bitextor generates translation memories from multilingual websites☆292Updated 5 months ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago