Kevin-thu / StoryMemLinks
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆164Updated this week
Alternatives and similar repositories for StoryMem
Users that are interested in StoryMem are comparing it to the libraries listed below
Sorting:
- ☆227Updated 5 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆181Updated 4 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆85Updated last week
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆633Updated last month
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆231Updated 3 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆539Updated last month
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆224Updated last week
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆180Updated 3 weeks ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆146Updated 3 months ago
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animation☆193Updated 4 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆123Updated 11 months ago
- Dataset and Benchmark code for EgoEdit☆92Updated 2 weeks ago
- Video Content Customization Using First Frame☆150Updated last week
- Krea Realtime 14B. An open-source realtime AI video model.☆431Updated last month
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 3 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆72Updated 2 weeks ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆89Updated 3 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆85Updated 3 weeks ago
- Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"☆145Updated 2 months ago
- Echo-4o☆462Updated 2 weeks ago
- Unified Video Editing with Temporal Reasoner☆105Updated last week
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆267Updated this week
- Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"☆330Updated last month
- https://little-misfit.github.io/GRAG-Image-Editing/☆116Updated last month
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.☆274Updated last month
- ☆316Updated 3 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆200Updated 6 months ago
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆176Updated 3 months ago
- Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944☆328Updated 4 months ago