ubc-vision / Make-A-Story
Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023
β39Updated last year
Alternatives and similar repositories for Make-A-Story:
Users that are interested in Make-A-Story are comparing it to the libraries listed below
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ78Updated 10 months ago
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β104Updated 9 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ83Updated 7 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β46Updated 4 months ago
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)β77Updated last year
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.β75Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"β41Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Modelsβ45Updated last year
- β24Updated 10 months ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generationβ68Updated last year
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillationβ59Updated 4 months ago
- β30Updated last year
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"β46Updated 4 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ95Updated 10 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ90Updated 10 months ago
- Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"β40Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"β66Updated 2 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntaxβ18Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β62Updated 10 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"β44Updated 2 months ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)β20Updated 9 months ago
- β77Updated last year
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024β37Updated last year
- [TMLR] Official PyTorch implementation of "Ξ»-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latentβ¦β51Updated 3 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generationβ38Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]β73Updated 2 weeks ago
- [ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"β36Updated last month
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversionβ35Updated 7 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completionβ36Updated 7 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)β37Updated last month