shivammehta25/Diff-TTSG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shivammehta25/Diff-TTSG)

shivammehta25 / Diff-TTSG

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

☆40

Alternatives and similar repositories for Diff-TTSG

Users that are interested in Diff-TTSG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
AlexandaJerry / SingingVoice-MFA-Training
View on GitHub
MFA acoustic model training based on Opencpop
☆15Sep 23, 2022Updated 3 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
zyhbili / LivelySpeaker
View on GitHub
[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".
☆87Jun 3, 2024Updated 2 years ago
cjerry1243 / Tacotron2-SpeechGesture
View on GitHub
This is the official repository for our publication "The IVI Lab entry to the GENEA Challenge 2022 – A Tacotron2 Based Method for Co-Spee…
☆13May 2, 2023Updated 3 years ago
Yoshifumi-Nakano / visual-text-to-speech
View on GitHub
visual-text to speech
☆14Apr 3, 2022Updated 4 years ago
theodorblackbird / lina-speech
View on GitHub
Official implementation of the TTS model Lina-Speech
☆178Jan 9, 2025Updated last year
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
Advocate99 / DiffGesture
View on GitHub
[CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
☆265Mar 18, 2026Updated 4 months ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆214Apr 26, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
genea-workshop / genea_numerical_evaluations
View on GitHub
Scripts for numerical evaluations for the GENEA Gesture Generation Challenge
☆24Nov 28, 2022Updated 3 years ago
yzhou359 / vid-reenact
View on GitHub
Code for CVPR 2022 paper "Audio-driven Neural Gesture Reenactment with Video Motion Graphs"
☆29Mar 28, 2022Updated 4 years ago
nii-yamagishilab / ZMM-TTS
View on GitHub
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆185Mar 6, 2024Updated 2 years ago
sony / bigvsan
View on GitHub
Pytorch implementation of BigVSAN
☆203Dec 9, 2025Updated 7 months ago
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆605Jul 22, 2024Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
OlaWod / PitchVC
View on GitHub
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆35Jun 6, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆361Jun 25, 2026Updated last month
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
alvinliu0 / HA2G
View on GitHub
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
☆144Mar 16, 2023Updated 3 years ago
mcfletch / sphfile
View on GitHub
NIST SPH File reader (e.g. for TEDLIUM Corpus)
☆26May 2, 2020Updated 6 years ago
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆376Sep 3, 2024Updated last year
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
eloimoliner / audio-inpainting-diffusion
View on GitHub
☆74Apr 4, 2024Updated 2 years ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hcy71o / TransferTTS
View on GitHub
TransferTTS (Zero-Shot learning of VITS)
☆102Sep 23, 2022Updated 3 years ago
ditto-tts / ditto-tts.github.io
View on GitHub
Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer
☆38Feb 17, 2025Updated last year
rgzn-aiyun / tacotron2-melgan
View on GitHub
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Mar 24, 2023Updated 3 years ago
YoungSeng / ReprGesture
View on GitHub
The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)
☆16Nov 8, 2022Updated 3 years ago
xjchenGit / MTDVocaLiST
View on GitHub
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
☆29Apr 3, 2024Updated 2 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago