ZZDoog/Speaker2Dubber

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZZDoog/Speaker2Dubber)

ZZDoog / Speaker2Dubber

[ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"

☆34

Alternatives and similar repositories for Speaker2Dubber

Users that are interested in Speaker2Dubber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
HDUyiming / SOCCER
View on GitHub
We are very happy that our work has been accepted by ACM Multimedia 2024！🥰
☆12Jan 8, 2025Updated last year
GalaxyCong / HPMDubbing
View on GitHub
[CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.
☆112Jun 21, 2024Updated 2 years ago
GalaxyCong / StyleDubber
View on GitHub
[ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"
☆98Nov 14, 2024Updated last year
GalaxyCong / EmoDubber
View on GitHub
[CVPR 2025] Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.
☆38Jun 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yili-19 / SSGPA
View on GitHub
☆17Jul 14, 2025Updated last year
kevendai / fandp-ijcai2025-issues
View on GitHub
☆17Oct 13, 2025Updated 9 months ago
ZZDoog / Avatar
View on GitHub
Avatar: An easy-to-use digital portrait PPT presentation video generation system based on Gradio
☆20Nov 7, 2023Updated 2 years ago
chenqi008 / V2C
View on GitHub
Pytorch implementation for “V2C: Visual Voice Cloning”
☆35Jan 28, 2023Updated 3 years ago
Junxi-Chen / PE-MIL
View on GitHub
[CVPR 2024] Official code for paper: Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection.
☆27Aug 19, 2024Updated last year
keonlee9420 / Stepwise_Monotonic_Multihead_Attention
View on GitHub
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …
☆39May 16, 2021Updated 5 years ago
KTTRCDL / UMETTS
View on GitHub
UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts
☆41Jun 12, 2025Updated last year
GalaxyCong / HPMDubbing_Vocoder
View on GitHub
16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)
☆18Apr 3, 2023Updated 3 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thuhcsi / VoxInstruct
View on GitHub
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
☆100Nov 9, 2024Updated last year
interactiveaudiolab / emphases
View on GitHub
Crowdsourced and Automatic Speech Prominence Estimation
☆27Apr 12, 2024Updated 2 years ago
tuyunbin / Review-of-Change-Captioning
View on GitHub
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Sep 2, 2025Updated 10 months ago
light1726 / SpeechTripleNet
View on GitHub
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆33Nov 23, 2023Updated 2 years ago
DavidMChan / Anim400K
View on GitHub
Anim-400K: A dataset designed from the ground up for automated dubbing of video
☆118Jun 21, 2024Updated 2 years ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
May2333 / FDCA
View on GitHub
[ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…
☆23Jul 28, 2025Updated last year
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
thevoicecompany / gazelle-train
View on GitHub
Joint speech-language model - respond directly to audio!
☆30May 13, 2024Updated 2 years ago
walker-hyf / ECSS
View on GitHub
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)
☆59Jun 20, 2024Updated 2 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
viczxchen / CofiPara
View on GitHub
☆13Oct 9, 2025Updated 9 months ago
zhenye234 / LLaSA_inference
View on GitHub
☆43Feb 8, 2025Updated last year
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆214Apr 26, 2024Updated 2 years ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
kuan2jiu99 / Awesome-Speech-Generation
View on GitHub
Survey on speech generation work.
☆21Nov 26, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chen-hangyu / Thermal-Gaussian-main
View on GitHub
Received by ICLR2025
☆60Nov 19, 2025Updated 8 months ago
apple-yinhan / Noise-robust-SED
View on GitHub
☆14Jan 2, 2025Updated last year
qulishen / Harmonizing-Light-and-Darkness
View on GitHub
This repository provides the official implementation for the following paper.
☆10Dec 25, 2024Updated last year
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
kaistmm / voxsim_trainer
View on GitHub
[INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset
☆24Sep 29, 2025Updated 10 months ago
NKU-HLT / AudioEditor
View on GitHub
☆47Apr 2, 2025Updated last year