shivammehta25 / Diff-TTSGLinks
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
☆39Updated last year
Alternatives and similar repositories for Diff-TTSG
Users that are interested in Diff-TTSG are comparing it to the libraries listed below
Sorting:
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆109Updated 3 years ago
- The project page repo for Neural Dubber.☆30Updated last year
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated last year
- ☆38Updated 3 months ago
- ESLTTS dataset☆16Updated 6 months ago
- An AR+AR TTS attempt.☆16Updated 6 months ago
- ☆57Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆70Updated last year
- ☆42Updated 6 months ago
- ☆36Updated 4 months ago
- ☆36Updated last year
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆109Updated last year
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆45Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆83Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆99Updated last year
- ☆25Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆73Updated 9 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆21Updated 6 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 7 months ago
- Official release of StyleTalk dataset.☆67Updated last year
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Updated last year
- ☆67Updated 2 years ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆129Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆93Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆44Updated 7 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆70Updated last year
- ☆8Updated 11 months ago
- a compact audio-to-phoneme aligner for singing voice☆11Updated last year
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆55Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆55Updated 9 months ago