ShigekiKarita / espnet-semi-supervisedView external linksLinks
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tree/karita-asrtts for newer code in ICASSP2019 Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders
☆38Feb 13, 2020Updated 6 years ago
Alternatives and similar repositories for espnet-semi-supervised
Users that are interested in espnet-semi-supervised are comparing it to the libraries listed below
Sorting:
- ☆76Mar 18, 2022Updated 3 years ago
- ☆42Mar 25, 2022Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- ☆10Nov 1, 2025Updated 3 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Feb 1, 2020Updated 6 years ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Oct 24, 2019Updated 6 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- An online speech recognition extension toolkit of Kaldi☆56Jun 23, 2021Updated 4 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- FVN is now obsolete. Please use CAPRICEP instead. I will stop updating this tool. Frequency domain variants of Velvet Noise, a flexible b…☆38Aug 12, 2020Updated 5 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Jun 27, 2019Updated 6 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆189Jan 29, 2020Updated 6 years ago
- ☆10Dec 16, 2018Updated 7 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 8 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- experiments with RETURNN☆161Feb 7, 2026Updated last week