zyj-2000 / CUMT_2D_PhotoSpeakerLinks
Official Repo for National Industrial Software Congress 2023:"An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation"
☆21Updated last year
Alternatives and similar repositories for CUMT_2D_PhotoSpeaker
Users that are interested in CUMT_2D_PhotoSpeaker are comparing it to the libraries listed below
Sorting:
- Collections of papers, databases, and codes targeted at Digital Human☆39Updated last year
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆31Updated this week
- ☆20Updated 11 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Updated last year
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆46Updated 9 months ago
- [ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.☆52Updated 10 months ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 4 months ago
- Video Generation Benchmark☆24Updated last month
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆54Updated last month
- Official implementation for "Diffusion Instruction Tuning"☆23Updated this week
- [ICME2025] From 2D Images to 3D Model: Weakly Supervised Multi-View Face Reconstruction with Deep Fusion☆29Updated 2 months ago
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆14Updated last year
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis☆47Updated 4 months ago
- ☆28Updated 5 months ago
- Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).☆25Updated 6 months ago
- Latent Diffusion Transformer for Talking Video Synthesis☆59Updated 6 months ago
- Unofficial implement of Live3DPortrait☆37Updated 3 months ago
- ☆40Updated 4 months ago
- ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.☆57Updated 2 months ago
- [MM 2023] Toward High Quality Facial Representation Learning☆15Updated last year
- (CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"☆95Updated last year
- [TVCG 2024] ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions☆17Updated 3 months ago
- Official code for "FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance", CVPR 2024.☆18Updated 8 months ago
- [ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization☆22Updated 5 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆43Updated last year
- Official implementation for 'Wild2Avatar: Rendering Humans Behind Occlusions'☆43Updated last year
- Open-vocabulary Semantic Segmentation☆34Updated last year
- ☆38Updated 5 months ago
- ☆77Updated 3 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆33Updated last year