haoningwu3639 / SimpleSDM-3
A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.
β16Updated this week
Alternatives and similar repositories for SimpleSDM-3:
Users that are interested in SimpleSDM-3 are comparing it to the libraries listed below
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.β22Updated 3 months ago
- A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.β16Updated 11 months ago
- XQ-GANπ: An Open-source Image Tokenization Framework for Autoregressive Generationβ179Updated last month
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficientβ75Updated last month
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.β14Updated 4 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generationβ94Updated last month
- π₯stable, simple, state-of-the-art VQVAE toolkit & cookbookβ73Updated 6 months ago
- Unified Multi-modal IAA Baseline and Benchmarkβ72Updated 3 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generationβ59Updated this week
- This is the official implementation for ControlVAR.β91Updated last month
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Modelsβ227Updated last month
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.β52Updated last week
- ICCV2023-Diffusion-Papersβ109Updated last year
- The collection of awesome papers on alignment of diffusion models.β74Updated this week
- β124Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ83Updated 6 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"β43Updated this week
- This is a repo to track the latest autoregressive visual generation papers.β105Updated 2 weeks ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"β23Updated 2 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.β73Updated last year
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β34Updated last month
- Training-Free Condition-Guided Text-to-Video Generationβ59Updated last year
- a collection of awesome autoregressive visual generation modelsβ63Updated 3 weeks ago
- Empowering Unified MLLM with Multi-granular Visual Generationβ114Updated this week
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generationβ38Updated last year
- β36Updated 2 weeks ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"β35Updated last month
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)β43Updated last month
- β19Updated 11 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generationβ66Updated 6 months ago