zyj-2000 / CUMT_2D_PhotoSpeaker
Official Repo for National Industrial Software Congress 2023:"An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation"
☆21Updated last year
Alternatives and similar repositories for CUMT_2D_PhotoSpeaker:
Users that are interested in CUMT_2D_PhotoSpeaker are comparing it to the libraries listed below
- Collections of papers, databases, and codes targeted at Digital Human☆37Updated last year
- ☆20Updated 9 months ago
- Official implementation for "Diffusion Instruction Tuning"☆22Updated 2 months ago
- ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.☆36Updated last month
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆31Updated 2 weeks ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆43Updated 8 months ago
- ☆63Updated last month
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆14Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated last year
- [AAAI 25] SegFace: Face Segmentation of Long-tail classes☆54Updated 3 months ago
- Latent Diffusion Transformer for Talking Video Synthesis☆58Updated 5 months ago
- This project is dedicated to advancing the field of animatronic robots by enabling them to generate lifelike facial expressions, pushing …☆3Updated last year
- [ICME2025] From 2D Images to 3D Model: Weakly Supervised Multi-View Face Reconstruction with Deep Fusion☆27Updated last month
- ☆62Updated last year
- ☆17Updated 11 months ago
- ☆35Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆53Updated this week
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆16Updated 2 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆38Updated last year
- Official implementation for 'Wild2Avatar: Rendering Humans Behind Occlusions'☆43Updated 10 months ago
- [ArXiv 2025] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆23Updated last week
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆41Updated last year
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆50Updated this week
- [ICASSP'25] DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face Synthesis☆43Updated 3 months ago
- [MM 2023] Toward High Quality Facial Representation Learning☆15Updated last year
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Updated last year
- Official code for "FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance", CVPR 2024.☆16Updated 7 months ago
- ☆67Updated 10 months ago
- ☆103Updated 9 months ago