thuhcsi / Crystal.TTVSLinks
Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.
☆87Updated 5 years ago
Alternatives and similar repositories for Crystal.TTVS
Users that are interested in Crystal.TTVS are comparing it to the libraries listed below
Sorting:
- The project page repo for Neural Dubber.☆30Updated 2 years ago
- ☆30Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Updated 5 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Updated 5 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 3 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- chinese tts☆75Updated 5 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139Updated 3 years ago
- ☆45Updated 6 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆76Updated 6 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Updated 7 years ago
- style token with tacotron2☆62Updated 2 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆41Updated last month
- ☆45Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆74Updated 6 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Updated 7 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Spleeter implementation in pytorch☆26Updated 5 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Updated 5 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Updated 4 years ago