rishikksh20 / LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
☆80Updated 3 years ago
Related projects: ⓘ
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆117Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Updated 3 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆72Updated 3 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆105Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- Efficient neural speech synthesis☆80Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆89Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆152Updated 2 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆133Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Alignment files of LibriTTS.☆57Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆188Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆126Updated 9 months ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆135Updated last year
- ☆110Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆151Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated last year
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆92Updated last year
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆171Updated last week
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆95Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆126Updated 4 months ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆52Updated 2 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆117Updated last year
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆108Updated 3 months ago