fudan-generative-vision / hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,481Updated 3 weeks ago
Alternatives and similar repositories for hallo2:
Users that are interested in hallo2 are comparing it to the libraries listed below
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,047Updated 6 months ago
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks☆1,064Updated 3 weeks ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,159Updated 7 months ago
- ☆767Updated 2 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆885Updated 3 weeks ago
- [LCLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation☆914Updated 3 months ago
- The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"☆1,070Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,234Updated 5 months ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,045Updated 2 weeks ago
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,261Updated last week
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,004Updated 3 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,486Updated 2 months ago
- Customized ID Consistent for human☆940Updated this week
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆541Updated last month
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,367Updated this week
- Video generation from text&image, 1st-gen☆745Updated last week
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆875Updated last week
- PantoMatrix: Generating Face and Body Animation from Speech☆967Updated last month
- 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆882Updated 4 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆565Updated 4 months ago
- Unofficial Implementation of Animate Anyone☆2,907Updated 7 months ago
- Build multimodal language agents for fast prototype and production☆1,777Updated this week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,039Updated 4 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆997Updated 4 months ago
- The official HelloMeme GitHub site☆566Updated last week
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆689Updated 3 weeks ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,626Updated last month
- Align Anything: Training All-modality Model with Feedback☆2,154Updated this week
- Memory-Guided Diffusion for Expressive Talking Video Generation☆718Updated 3 weeks ago
- Diffusion-based Portrait and Animal Animation☆668Updated last month