Accepted as [NeurIPS 2024] Spotlight Presentation Paper
☆6,405Sep 26, 2024Updated last year
Alternatives and similar repositories for StoryDiffusion
Users that are interested in StoryDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of AnimateDiff.☆12,107Jul 31, 2024Updated last year
- Your image is almost there!☆7,630Jul 26, 2024Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆28,904Apr 9, 2026Updated 2 weeks ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆3,003Sep 8, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,023Jul 2, 2024Updated last year
- Enjoy the magic of Diffusion models!☆12,258Apr 16, 2026Updated last week
- [SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation☆5,957Mar 19, 2025Updated last year
- PhotoMaker [CVPR 2024]☆10,110Oct 31, 2024Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,946Jul 18, 2024Updated last year
- More relighting!☆8,408Feb 20, 2025Updated last year
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,158Mar 8, 2026Updated last month
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,668Nov 4, 2025Updated 5 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,156Jan 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,293Nov 27, 2025Updated 4 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,838Jun 28, 2024Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,666Mar 5, 2025Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆2,007Sep 18, 2024Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,367Jan 24, 2025Updated last year
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,631Mar 27, 2025Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,536Jul 31, 2025Updated 8 months ago
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,053Jan 9, 2026Updated 3 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,264Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,533Jun 28, 2024Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,501May 31, 2024Updated last year
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,608Aug 15, 2024Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Feb 1, 2025Updated last year
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,785Sep 20, 2025Updated 7 months ago
- Bring portraits to life!☆18,158Mar 2, 2026Updated last month
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,908Aug 29, 2025Updated 7 months ago
- Generative Models by Stability AI☆27,117Dec 16, 2025Updated 4 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,228Apr 8, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Kolors Team☆4,611Nov 13, 2024Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,645Sep 25, 2024Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,492Feb 19, 2025Updated last year
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,568Nov 18, 2025Updated 5 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,292Oct 31, 2024Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,851Mar 7, 2025Updated last year
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆4,975Mar 7, 2025Updated last year