An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,957Feb 11, 2024Updated 2 years ago
Alternatives and similar repositories for VALL-E-X
Users that are interested in VALL-E-X are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,211Sep 10, 2025Updated 6 months ago
- 🔊 Text-Prompted Generative Audio Model☆39,066Aug 19, 2024Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,896Aug 16, 2024Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,777Mar 3, 2026Updated 3 weeks ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆8,463Aug 13, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,992May 10, 2023Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,227Aug 10, 2024Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆23,112Mar 3, 2026Updated 3 weeks ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,168Apr 19, 2025Updated 11 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,721May 27, 2025Updated 10 months ago
- Text-to-Audio/Music Generation☆2,607Sep 29, 2024Updated last year
- vits2 backbone with multilingual-bert