haoningwu3639 / SimpleSDM-3
A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.
☆18Updated 3 months ago
Alternatives and similar repositories for SimpleSDM-3:
Users that are interested in SimpleSDM-3 are comparing it to the libraries listed below
- A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.☆17Updated last year
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆23Updated 7 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆98Updated 3 weeks ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆80Updated 2 weeks ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆54Updated 6 months ago
- ☆21Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆115Updated 4 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆240Updated this week
- FQGAN: Factorized Visual Tokenization and Generation☆48Updated 3 weeks ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆70Updated 11 months ago
- Unified Multi-modal IAA Baseline and Benchmark☆75Updated 6 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆99Updated 6 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆88Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆119Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- This is the official implementation for ControlVAR.☆102Updated 4 months ago
- a collection of awesome autoregressive visual generation models☆72Updated last week
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆34Updated last week
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆118Updated last month
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆73Updated last week
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]☆18Updated 7 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆49Updated 4 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- Fine-tune VAE of Stable Diffusion model☆34Updated 7 months ago
- ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning☆28Updated 2 weeks ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆149Updated 2 months ago
- ☆21Updated 2 weeks ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆46Updated 3 months ago