Dorniwang / SpeakerVid-5M-CodeLinks
The official SpeakerVid-5M data curation code.
☆57Updated 5 months ago
Alternatives and similar repositories for SpeakerVid-5M-Code
Users that are interested in SpeakerVid-5M-Code are comparing it to the libraries listed below
Sorting:
- Muti-human Interactive Talking Dataset☆61Updated 4 months ago
- ☆27Updated 9 months ago
- ☆62Updated last month
- The official UniVerse-1 code.☆114Updated 2 months ago
- ☆27Updated 6 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆53Updated last year
- ☆24Updated last year
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Updated last month
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆64Updated 6 months ago
- Towards Variable and Coordinated Holistic Co-Speech Motion Generation, CVPR 2024☆58Updated last year
- The official code of Human MotionFormer: Transferring Human Motions with Vision Transformers, ICLR2023☆34Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Updated last month
- This is official inference code of PD-FGC☆98Updated 2 years ago
- MMHead: Towards Fine-grained Multi-modal 3D Facial Animation (ACM MM 2024)☆34Updated 2 months ago
- ☆31Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆48Updated 4 months ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆38Updated 7 months ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆20Updated 2 months ago
- [AAAI 2024] Continuous Piecewise-Affine Based Motion Model for Image Animation☆35Updated last year
- ☆45Updated 6 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆100Updated last month
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆60Updated 2 months ago
- A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.☆40Updated 7 months ago
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆34Updated last year
- ☆14Updated 3 years ago
- [🔥ICCV 2025] SemTalk Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis☆37Updated this week
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆36Updated 2 months ago
- Official implentation of SingingHead: A Large-scale 4D Dataset for Singing Head Animation. (TMM 25)☆62Updated 3 weeks ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆62Updated 8 months ago
- ☆128Updated last year