0417keito / VALL-E-X-Trainer-by-CustomDataView external linksLinks
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆69Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for VALL-E-X-Trainer-by-CustomData
Users that are interested in VALL-E-X-Trainer-by-CustomData are comparing it to the libraries listed below
Sorting:
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 6 months ago
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- ☆28Nov 15, 2023Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆69Mar 31, 2024Updated last year
- Heteronym to Phoneme Parser☆19Nov 4, 2023Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,200Sep 10, 2025Updated 5 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- ☆49Jul 22, 2024Updated last year
- unofficial vits2-TTS implementation in pytorch☆546Mar 28, 2024Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆366Sep 3, 2024Updated last year
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Aug 4, 2023Updated 2 years ago
- The Open Source Code of UniAudio☆603Jul 22, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- ☆39Oct 1, 2023Updated 2 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated last year
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- text to speech using autoregressive transformer and VITS☆249Apr 3, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆211Apr 26, 2024Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated last year
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- VALL-E 2 reproduction☆134Jul 14, 2024Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆132Dec 29, 2025Updated last month
- ☆26Jun 5, 2024Updated last year
- ☆10Dec 10, 2021Updated 4 years ago