Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.
☆37Jun 3, 2025Updated 9 months ago
Alternatives and similar repositories for EmoDubber
Users that are interested in EmoDubber are comparing it to the libraries listed below
Sorting:
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆98Nov 14, 2024Updated last year
- 16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)☆18Apr 3, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 8 months ago
- [CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.☆111Jun 21, 2024Updated last year
- Pytorch implementation for “V2C: Visual Voice Cloning”☆33Jan 28, 2023Updated 3 years ago
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆27May 8, 2025Updated 9 months ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆33May 7, 2025Updated 9 months ago
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 6 months ago
- ☆13Jul 17, 2024Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆23Dec 8, 2024Updated last year
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆22Jul 28, 2025Updated 7 months ago
- ☆14Oct 10, 2024Updated last year
- [WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation☆22Feb 26, 2026Updated last week
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- [TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".☆27Jul 23, 2025Updated 7 months ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- The project page repo for Neural Dubber.☆30Sep 20, 2023Updated 2 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Mar 16, 2023Updated 2 years ago
- Code base for the "Motion Matching for Responsive Animation For Digital Humans" project.☆21Jun 15, 2023Updated 2 years ago
- ☆25Dec 19, 2024Updated last year
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis☆54Oct 25, 2025Updated 4 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- Avatar: An easy-to-use digital portrait PPT presentation video generation system based on Gradio☆20Nov 7, 2023Updated 2 years ago
- [ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".☆20Sep 25, 2025Updated 5 months ago
- ☆27Jun 27, 2023Updated 2 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆41Jun 12, 2025Updated 8 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- Source code of the paper "An efficient implementation for solving the all pairs minimax path problem in an undirected dense graph."☆16Dec 3, 2025Updated 3 months ago
- [AAAI2025] GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians☆37Apr 2, 2025Updated 11 months ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 5 months ago
- Realtime voice agents for role play and more.☆41Mar 7, 2025Updated 11 months ago
- GAN Step By Step -- GSBS,顾名思义,我希望我自己能够一步一步的学习GAN。GAN 又名 生成对抗网络,是最近几年很热门的一种无监督算法,他能生成出非常逼真的照片,图像甚至视频。GAN是一个图像的全新的领域,从2014的GAN的发展现在,在计算机视觉中…☆11Jan 11, 2023Updated 3 years ago
- ☆13Dec 16, 2022Updated 3 years ago
- Wind Turbine Blade Image Dateset☆13May 23, 2019Updated 6 years ago