HVision-NKU / StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
☆5,923Updated last month
Related projects ⓘ
Alternatives and complementary repositories for StoryDiffusion
- Your image is almost there!☆7,318Updated 3 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,630Updated 4 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,570Updated 2 months ago
- Kolors Team☆3,824Updated 2 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,242Updated 3 weeks ago
- Enjoy the magic of Diffusion models!☆6,552Updated this week
- More relighting!☆5,326Updated last week
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,249Updated 3 months ago
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,314Updated 2 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,312Updated 4 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,070Updated 3 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆3,862Updated 2 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,443Updated 4 months ago
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,289Updated last week
- ☆2,365Updated 5 months ago
- PhotoMaker [CVPR 2024]☆9,530Updated last week
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆11,521Updated this week
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,565Updated 5 months ago
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,411Updated 2 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,547Updated last week
- The best OSS video generation models☆1,804Updated this week
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,209Updated last week
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,689Updated 3 weeks ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆2,957Updated 2 weeks ago
- Official implementation of AnimateDiff.☆10,539Updated 3 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,656Updated last month
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆3,988Updated 7 months ago
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆3,884Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆22,154Updated 3 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,231Updated 4 months ago