HaoranMiao / streaming-attentionView external linksLinks
streaming attention networks for end-to-end automatic speech recognition
☆55May 6, 2020Updated 5 years ago
Alternatives and similar repositories for streaming-attention
Users that are interested in streaming-attention are comparing it to the libraries listed below
Sorting:
- ☆26Apr 21, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Apr 2, 2018Updated 7 years ago
- Example implementation of Monotonic Chunkwise Attention.☆53Feb 23, 2018Updated 7 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆67Dec 28, 2020Updated 5 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- a pytorch implementation of Google GEDLoss☆32Dec 9, 2020Updated 5 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- A python implementation of the Griffin Lim Algorithm for audio reconstruction from magnitudes☆34Jan 17, 2024Updated 2 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Towards hot directions in industrial end to end speech recognition☆331Nov 30, 2021Updated 4 years ago
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- An extended TSP (Time Stretched Pulse). CAPRICEP substantially replaces FVN. CAPRICEP enables interactive and real-time measurement of th…☆29Nov 2, 2023Updated 2 years ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆176Sep 16, 2020Updated 5 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 2 months ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Jun 3, 2020Updated 5 years ago