Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
☆202Jul 9, 2023Updated 2 years ago
Alternatives and similar repositories for ARLDM
Users that are interested in ARLDM are comparing it to the libraries listed below
Sorting:
- ☆335Feb 14, 2023Updated 3 years ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Jun 27, 2023Updated 2 years ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆263Dec 2, 2024Updated last year
- Official code repository for the EMNLP 2021 paper☆26Jan 30, 2022Updated 4 years ago
- ☆10Sep 12, 2024Updated last year
- [CVPR 2025] Official PyTorch implementation of StoryGPT-V☆40Jun 14, 2025Updated 8 months ago
- Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis(CVPR2022)☆26May 3, 2022Updated 3 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆109Jan 23, 2024Updated 2 years ago
- ☆33Jan 30, 2022Updated 4 years ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆68Sep 26, 2024Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆506Oct 7, 2025Updated 4 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆357Jul 4, 2023Updated 2 years ago
- Unofficial implementation of Tune-A-Video☆192Jan 12, 2023Updated 3 years ago
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…☆86Apr 22, 2023Updated 2 years ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆764Jan 26, 2024Updated 2 years ago
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆504Nov 16, 2024Updated last year
- Official PyTorch codes for the paper: "ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation"☆243Mar 20, 2024Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆521Apr 2, 2024Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆413Mar 25, 2024Updated last year
- ☆82Jul 31, 2023Updated 2 years ago
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters☆270Mar 22, 2024Updated last year
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆427Aug 25, 2025Updated 6 months ago
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆316Jul 11, 2024Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Nov 23, 2024Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,971Dec 1, 2025Updated 3 months ago
- Retrieval-Augmented Video Generation for Telling a Story☆259Feb 5, 2024Updated 2 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Jan 8, 2024Updated 2 years ago
- Official Pytorch implementation of GGDR (ECCV 2022)☆102Aug 10, 2022Updated 3 years ago
- AnimateDiff I2V version.☆185Mar 1, 2024Updated 2 years ago
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆19Sep 2, 2024Updated last year
- ☆56Apr 30, 2024Updated last year
- [ECCV 2022] Official Pytorch implementation of "Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Imag…☆126Feb 4, 2023Updated 3 years ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆91Feb 14, 2025Updated last year
- [ICCV 2023] Official PyTorch implementation for the paper "FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model"☆307Oct 12, 2023Updated 2 years ago
- ☆24Dec 13, 2025Updated 2 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆116Jun 4, 2023Updated 2 years ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆168Nov 18, 2024Updated last year
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆175Oct 8, 2023Updated 2 years ago