vlarine / wav2vec
vq-wav2vec inference
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for wav2vec
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 2 years ago
- TTS Text Analyzer☆32Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆62Updated last month
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- ☆62Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 8 months ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆17Updated 4 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.