yangdongchao / text-to-sound-synthesis-demo
This is a demo webpage for our paper 'text-to-sound synthesis'
☆125Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for text-to-sound-synthesis-demo
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"☆349Updated last year
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆111Updated 3 months ago
- ☆70Updated 2 years ago
- AI 音乐 - compound-word-transformer,用 Tensorflow 实现☆140Updated last year
- ☆193Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆227Updated last year
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆99Updated 5 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- Monotonic Alignment Search☆86Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆32Updated last year
- TransferTTS (Zero-Shot learning of VITS)☆90Updated 2 years ago
- a lightweight voice conversion☆78Updated 2 months ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆211Updated last year
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆177Updated last year
- Forced Alignment-MFA☆33Updated 2 years ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆91Updated last month
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆206Updated last year
- An 16kHz implementation of HiFi-GAN for soft-vc.☆93Updated last year
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆238Updated last week
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 3 months ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆331Updated 2 years ago
- Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。☆45Updated 3 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆46Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆134Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139Updated 2 years ago
- The deme page of InstructTTS☆155Updated 9 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆66Updated 2 years ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆127Updated 5 months ago
- Finetuning VITS Efficiently☆32Updated last year
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago