[ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation
☆24Mar 25, 2025Updated 11 months ago
Alternatives and similar repositories for DiffusionTalker
Users that are interested in DiffusionTalker are comparing it to the libraries listed below
Sorting:
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 5 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated 2 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- ☆13Jul 10, 2024Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 10 months ago
- Controllable Group Choreography using Contrastive Diffusion☆18Nov 25, 2025Updated 3 months ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆35Apr 29, 2025Updated 10 months ago
- [ICCV2025 Highlight] GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting☆25Sep 10, 2025Updated 5 months ago
- ☆19Jul 8, 2024Updated last year
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆18Sep 5, 2024Updated last year
- codes for Makeup Extraction of 3D Representation via Illumination-Aware Image Decomposition (Eurographics2023)☆18Mar 2, 2025Updated last year
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆166Mar 31, 2024Updated last year
- Just a copy of https://github.com/RobynE23/CodeHS-Java-APCSA, but I added folders and some extra files that didn't exist. Another option …☆27Jan 23, 2024Updated 2 years ago
- ☆62Jul 1, 2025Updated 8 months ago
- ☆17Jul 16, 2025Updated 7 months ago
- [CVPR 2025] Official Implementation of "MixerMDM: Learnable Composition of Human Motion Diffusion Models".☆24Sep 8, 2025Updated 5 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆45Jan 25, 2026Updated last month
- [NeurIPS24] Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos☆21Jan 27, 2026Updated last month
- [CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation☆55Dec 16, 2025Updated 2 months ago
- The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis …☆20Dec 15, 2024Updated last year
- A framework for text-based retrieval augmented motion generation☆24Feb 18, 2025Updated last year
- Code for the paper "Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters"☆24Jan 7, 2025Updated last year
- [T-PAMI2025] Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy☆28Jan 13, 2025Updated last year
- ☆31Jul 1, 2025Updated 8 months ago
- Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021☆24Oct 20, 2021Updated 4 years ago
- Free-T2M: Frequency enhanced text-to-motion diffusion model with consistency loss☆70Feb 9, 2025Updated last year
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆278Dec 5, 2025Updated 2 months ago
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆27Jul 22, 2024Updated last year
- The official pytorch code for TalkingStyle: Personalized Speech-Driven Facial Animation with Style Preservation☆31Jul 3, 2024Updated last year
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆238Nov 12, 2025Updated 3 months ago
- Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]☆62Feb 13, 2025Updated last year
- ☆34Dec 16, 2025Updated 2 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Wan 2.5 AI Video Generator - Transform text & images into HD videos with synchronized audio☆79Sep 25, 2025Updated 5 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- Code for ReMoS: 3D-Motion Conditioned Reaction Synthesis for Two-person Interactions (ECCV 2024)☆34Mar 4, 2025Updated 11 months ago