ChaosCY / LAS-asrLinks
This is the TensorFlow implementation of the Google LAS model.
☆14Updated 6 years ago
Alternatives and similar repositories for LAS-asr
Users that are interested in LAS-asr are comparing it to the libraries listed below
Sorting:
- ☆45Updated 6 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 6 years ago
- A Neural Machine Translation toolkit for research purpose☆82Updated 7 months ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Updated 4 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Updated 6 years ago
- experiments with RETURNN☆160Updated last month
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 5 years ago
- ☆99Updated 7 years ago
- ☆55Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- Example implementation of Monotonic Chunkwise Attention.☆52Updated 7 years ago
- Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow☆52Updated 8 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆77Updated 4 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 5 years ago
- mWER loss implementation in tensorflow☆31Updated 4 years ago
- Speech2vec pre-trained word vectors☆76Updated 6 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆237Updated 5 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆220Updated 5 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 6 years ago
- a standalone pitch extractor☆13Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Recurrent Neural Aligner☆50Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Updated 3 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 4 years ago
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Updated 7 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆174Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Updated 5 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆100Updated 8 years ago