Kevin-thu / StoryMemLinks
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆620Updated 3 weeks ago
Alternatives and similar repositories for StoryMem
Users that are interested in StoryMem are comparing it to the libraries listed below
Sorting:
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆417Updated last week
- ☆706Updated 2 months ago
- Streamlining Cartoon Production with Generative Post-Keyframing☆529Updated 5 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,080Updated 3 weeks ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆713Updated last month
- Official Code Repo for UniVA: Universal Video Agents☆299Updated last month
- One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆429Updated last month
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆493Updated 5 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆310Updated 3 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆328Updated 3 months ago
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆500Updated 3 weeks ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆591Updated last month
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆556Updated 2 months ago
- ☆368Updated 10 months ago
- In-context subject-driven image generation while preserving foreground fidelity☆350Updated 7 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆282Updated last week
- Lynx: Towards High-Fidelity Personalized Video Generation☆305Updated 3 months ago
- Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stri…☆298Updated last month
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆260Updated 11 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆278Updated last month
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆425Updated 7 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated 4 months ago
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation☆562Updated 3 weeks ago
- [NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…☆617Updated 2 months ago
- [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"☆430Updated 8 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆222Updated 4 months ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆656Updated 2 months ago
- Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"☆1,407Updated 3 weeks ago
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆299Updated 9 months ago
- Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.☆863Updated 4 months ago