zyj-2000 / CUMT_2D_PhotoSpeakerLinks
Official Repo for National Industrial Software Congress 2023:"An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation"
☆20Updated 2 years ago
Alternatives and similar repositories for CUMT_2D_PhotoSpeaker
Users that are interested in CUMT_2D_PhotoSpeaker are comparing it to the libraries listed below
Sorting:
- ☆95Updated 10 months ago
- [CVPR 2024 Highlight] Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis☆237Updated last year
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆186Updated 4 months ago
- GRPose: Learning Graph Relations for Human Image Generation with Pose Priors (AAAI 2025)☆18Updated 9 months ago
- ☆53Updated last year
- (CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"☆99Updated last year
- Unofficial implement of Live3DPortrait☆40Updated 6 months ago
- Unofficial implementation of Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆40Updated 2 years ago
- Video Generation Benchmark☆68Updated 7 months ago
- GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting☆89Updated last year
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆92Updated 2 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆116Updated 9 months ago
- [ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding☆66Updated 6 months ago
- A Collection of AIGC Research Groups☆89Updated 2 months ago
- [ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization☆23Updated last year
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆48Updated last year
- ☆144Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆62Updated 9 months ago
- Official release of FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model (ACMMM2024)☆25Updated last year
- This repo contains the code for PreciseControl project [ECCV'24]☆69Updated last year
- Official implementation of SGDiff (ACM MM '23)☆37Updated 2 years ago
- One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild☆21Updated 9 months ago
- Muti-human Interactive Talking Dataset☆67Updated 5 months ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Updated 3 months ago
- [SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization☆32Updated 2 months ago
- Towards Localized Fine-Grained Control for Facial Expression Generation☆84Updated last year
- (AAAI2024) Controllable 3D Face Generation with Conditional Style Code Diffusion☆38Updated last year
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆262Updated last year
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universes☆99Updated last year
- Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).☆31Updated last year