XinyuZhou2000 / Spoken-DialogueView external linksLinks
☆18Dec 7, 2023Updated 2 years ago
Alternatives and similar repositories for Spoken-Dialogue
Users that are interested in Spoken-Dialogue are comparing it to the libraries listed below
Sorting:
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆26Jun 5, 2024Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated last year
- ☆11Sep 26, 2024Updated last year
- VIPNet: Visual Interaction Perceptual Network for Blind Image Quality Assessment☆11Dec 18, 2024Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- official code for "EgoVSR: Towards High-Quality Egocentric Video Super-Resolution"☆15Jul 26, 2023Updated 2 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- ☆197May 3, 2024Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Jun 6, 2022Updated 3 years ago
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- vq-wav2vec inference☆13Dec 13, 2021Updated 4 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 5 months ago
- TransferTTS (Zero-Shot learning of VITS)☆100Sep 23, 2022Updated 3 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago
- ☆18Jul 31, 2019Updated 6 years ago
- ☆77Apr 26, 2022Updated 3 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆17May 24, 2020Updated 5 years ago
- ☆47Aug 31, 2024Updated last year
- English conversation corpus for conversational TTS.☆21Mar 13, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆86Aug 3, 2023Updated 2 years ago