TouchSky-Lab / Awesome-Text-to-Speech-TTS
Awesome TTS
☆54Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Text-to-Speech-TTS
- Official Code for ParrotTTS☆43Updated last month
- flow mirror models from JZX AI Labs☆40Updated last month
- Official release of StyleTalk dataset.☆57Updated 4 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆39Updated 5 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆84Updated last month
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆113Updated 2 weeks ago
- ☆31Updated 3 weeks ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆71Updated 7 months ago
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆97Updated 2 months ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆139Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆38Updated last year
- ☆65Updated last year
- Monotonic Alignment Search☆86Updated 2 years ago
- Train the next generation of TTS systems.☆161Updated 2 months ago
- Implementation of Google's USM speech model in Pytorch☆25Updated 2 weeks ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆56Updated 3 months ago
- An unofficial PyTorch implementation of VALL-E☆77Updated this week
- The official implementation of EmoSphere++☆41Updated 2 weeks ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆37Updated last year
- Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.☆201Updated 10 months ago
- A Survey of Spoken Dialogue Models (60 pages)☆97Updated this week
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆61Updated 5 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆127Updated 5 months ago
- Putting flows on top of neural transducers for better TTS☆63Updated 3 weeks ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆134Updated last year