lucidrains / NWT-pytorchView external linksLinks
Implementation of NWT, audio-to-video generation, in Pytorch
☆92Mar 17, 2022Updated 3 years ago
Alternatives and similar repositories for NWT-pytorch
Users that are interested in NWT-pytorch are comparing it to the libraries listed below
Sorting:
- ☆10Apr 8, 2024Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆67Jan 10, 2023Updated 3 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- ☆15May 8, 2021Updated 4 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- The official source code of UniAudio☆95Mar 29, 2024Updated last year
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Neural style transfer☆21Jul 29, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- ☆20Aug 19, 2021Updated 4 years ago
- Official Implementation of StyleTTS-VC☆196Jan 14, 2025Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 3 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Mar 5, 2022Updated 3 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 3 months ago
- ☆37May 8, 2021Updated 4 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- Collect Voice Conversion researches☆96Updated this week
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Jul 17, 2021Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago