ottokart / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆669Updated 3 years ago
Alternatives and similar repositories for punctuator2:
Users that are interested in punctuator2 are comparing it to the libraries listed below
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- Punctuation restoration and spell correction experiments.☆250Updated 3 years ago
- A sentence segmenter that actually works!☆303Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆835Updated 2 years ago
- G2P with Tensorflow☆669Updated 6 months ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- Phonetisaurus G2P☆459Updated 7 months ago
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆205Updated 6 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 5 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 4 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 2 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,193Updated 3 months ago
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- A collection of links and notes on forced alignment tools☆884Updated 3 years ago
- A Python wrapper for Kaldi☆1,005Updated last week
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆674Updated 2 months ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- Simple, fast unsupervised word aligner☆743Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- CMU Wilderness Multilingual Speech Dataset☆273Updated 5 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 4 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 5 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆375Updated last year
- A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunk…☆233Updated 6 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆274Updated last year
- Speech-to-text based on wav2letter built for transfer learning☆97Updated 2 years ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆440Updated 10 months ago