ex3ndr / supervoice-voicebox
VoiceBox neural network implementation
☆96Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for supervoice-voicebox
- VALL-E 2 reproduction☆87Updated 4 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Official implementation of Vec-Tok Speech☆93Updated last year
- Official Implementation of StyleTTS-VC☆164Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆126Updated 8 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆93Updated 2 weeks ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆132Updated 3 months ago
- ☆81Updated 2 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆149Updated 2 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆104Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆97Updated 2 weeks ago
- Train the next generation of TTS systems.☆161Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆64Updated last week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month
- All generative model in one for better TTS model☆66Updated 2 months ago
- ☆32Updated 2 months ago
- ☆70Updated last year
- ☆57Updated 2 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- Finetuning VITS Efficiently☆32Updated last year
- ☆33Updated last year
- The open source code for SimpleSpeech series☆111Updated last month
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆135Updated 6 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- A TTS model that makes a speaker speak new languages☆75Updated 5 months ago
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- An unofficial PyTorch implementation of VALL-E☆77Updated this week
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆114Updated 5 months ago
- Implementation of TTS model based on NVIDIA P-Flow TTS Paper☆67Updated 6 months ago