An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆70Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for VALL-E-X-Trainer-by-CustomData
Users that are interested in VALL-E-X-Trainer-by-CustomData are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 8 months ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- Real-time end-to-end singing voice convertion☆24Nov 3, 2024Updated last year
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,208Sep 10, 2025Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆55Oct 31, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆68Mar 31, 2024Updated 2 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆371Sep 3, 2024Updated last year
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆277Oct 30, 2023Updated 2 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- The Open Source Code of UniAudio☆604Jul 22, 2024Updated last year
- unofficial vits2-TTS implementation in pytorch☆548Mar 28, 2024Updated 2 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆29Mar 28, 2024Updated 2 years ago
- Unoffical implementation of Megatts2☆286Mar 23, 2024Updated 2 years ago
- Train the next generation of TTS systems.☆170Sep 13, 2024Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- VALL-E 2 reproduction☆135Jul 14, 2024Updated last year
- VoiceBox neural network implementation☆109Aug 2, 2024Updated last year
- ☆28Nov 15, 2023Updated 2 years ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆212Apr 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A collection of all our phonemeizers for dataset construction and inference☆28Feb 21, 2025Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆85Aug 3, 2023Updated 2 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- ☆81Aug 8, 2025Updated 8 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 6 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆680Oct 1, 2024Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 5 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆117Nov 2, 2023Updated 2 years ago
- ☆27Dec 16, 2023Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- Official Implementation of StyleTTS-VC☆198Jan 14, 2025Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago