Dorniwang / SpeakerVid-5M-CodeLinks
The official SpeakerVid-5M data curation code.
☆56Updated 4 months ago
Alternatives and similar repositories for SpeakerVid-5M-Code
Users that are interested in SpeakerVid-5M-Code are comparing it to the libraries listed below
Sorting:
- Muti-human Interactive Talking Dataset☆57Updated 4 months ago
- ☆62Updated last week
- ☆27Updated 9 months ago
- The official UniVerse-1 code.☆108Updated last month
- ☆24Updated 11 months ago
- Towards Variable and Coordinated Holistic Co-Speech Motion Generation, CVPR 2024☆58Updated last year
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆53Updated 11 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆35Updated last month
- MMHead: Towards Fine-grained Multi-modal 3D Facial Animation (ACM MM 2024)☆33Updated last month
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆38Updated 7 months ago
- ☆26Updated 6 months ago
- [AAAI 2024] SAAS - Official PyTorch Implementation☆10Updated last year
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆19Updated last month
- ☆45Updated 5 months ago
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆35Updated 2 weeks ago
- The official code of Human MotionFormer: Transferring Human Motions with Vision Transformers, ICLR2023☆34Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆51Updated last month
- UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons (ACM MM 2023 Oral)☆54Updated last year
- [ICCV2025] SemTalk Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis☆36Updated this week
- This is official inference code of PD-FGC☆97Updated 2 years ago
- A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.☆40Updated 6 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆22Updated 8 months ago
- ☆128Updated last year
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"☆64Updated 6 months ago
- ☆14Updated 2 years ago
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆47Updated last year
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆56Updated 9 months ago
- ☆20Updated last year
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆93Updated 3 weeks ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆30Updated last year