fudan-generative-vision / hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆4,195Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for hallo2
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,026Updated 3 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,746Updated 4 months ago
- ☆759Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆9,496Updated 2 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,102Updated this week
- The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"☆712Updated 3 weeks ago
- Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language m…☆3,896Updated this week
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,300Updated 3 months ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,565Updated 2 weeks ago
- 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆987Updated last month
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆770Updated 2 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆647Updated 2 months ago
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,265Updated last month
- SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling☆789Updated 3 weeks ago
- Unofficial Implementation of Animate Anyone☆2,940Updated 4 months ago
- ☆2,031Updated last month
- cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注…☆2,146Updated this week
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,440Updated 4 months ago
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,828Updated last week
- PantoMatrix: Co-Speech Talking Head and Gestures Generation☆994Updated 4 months ago
- CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, de…☆3,018Updated this week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,092Updated 2 weeks ago
- [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"☆520Updated 6 months ago
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆3,277Updated this week
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,369Updated 6 months ago
- Accelerating the development of large multimodal models (LMMs) with lmms-eval☆2,068Updated this week
- 🔥minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽水 minerproxy minerproxy minerproxy miner…☆5,166Updated last week
- An MBTI Exploration of Large Language Models☆474Updated 9 months ago
- Create textures for 3d models using stable-diffusion and blender☆833Updated last year
- A high-performance IM server.☆1,222Updated this week