ottokart / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆672Updated 3 years ago
Alternatives and similar repositories for punctuator2:
Users that are interested in punctuator2 are comparing it to the libraries listed below
- Punctuation restoration and spell correction experiments.☆251Updated 3 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- A sentence segmenter that actually works!☆303Updated 4 years ago
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 4 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- G2P with Tensorflow☆669Updated 6 months ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆206Updated 6 months ago
- g2p: English Grapheme To Phoneme Conversion☆835Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- A collection of links and notes on forced alignment tools☆889Updated 3 years ago
- Phonetisaurus G2P☆460Updated 8 months ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆274Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 5 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A Python wrapper for Kaldi☆1,006Updated 3 weeks ago
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆467Updated 4 years ago
- Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.☆224Updated 5 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,194Updated 4 months ago
- Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark☆444Updated 8 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 5 years ago
- CMU Wilderness Multilingual Speech Dataset☆274Updated 5 years ago
- Command line utility for forced alignment using Kaldi☆1,397Updated 2 months ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,861Updated 2 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 5 months ago