fudan-generative-vision / hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
☆9,151Updated this week
Related projects: ⓘ
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆3,909Updated 2 months ago
- Unofficial Implementation of Animate Anyone☆2,900Updated 2 months ago
- A UI-Focused Agent for Windows OS Interaction.☆7,568Updated this week
- The open source platform for AI-native application development.☆6,038Updated last week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆8,587Updated this week
- ☆4,199Updated this week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,182Updated 2 months ago
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,272Updated last month
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆6,824Updated last week
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,318Updated 2 months ago
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- Create Magic Story!☆5,787Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,108Updated last month
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆865Updated last month
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆3,926Updated 5 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,334Updated 2 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆2,406Updated last month
- Enjoy the magic of Diffusion models!☆6,349Updated this week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,376Updated last month
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆2,425Updated this week
- Real time interactive streaming digital human☆3,462Updated last week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,595Updated last week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆2,889Updated 2 months ago
- Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆2,382Updated last month
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆3,277Updated last month
- Your image is almost there!☆7,207Updated last month
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,364Updated last month
- Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into workin…☆6,478Updated last month
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,401Updated last week