foocker / AIGC
扩散模型算法基础文档、训练、实验、部署等仓库
☆26Updated 3 months ago
Related projects: ⓘ
- https://www.shoufachen.com/Awesome-Diffusion-Transformers/☆106Updated 6 months ago
- ☆23Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆70Updated this week
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆36Updated 5 months ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆51Updated last year
- Scaling RWKV-Like Architectures for Diffusion Models☆110Updated 5 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆134Updated last week
- Precision Search through Multi-Style Inputs☆45Updated last month
- ☆34Updated 3 months ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆168Updated last week
- [SIGGRAPH Asia 2024] Painting process generating using diffusion models☆46Updated last month
- 这是一个DiT-pytorch的代码,主要用于学习DiT结构。☆59Updated 6 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆56Updated 2 months ago
- ☆48Updated this week
- A list for Text-to-Video, Image-to-Video works☆167Updated last month
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆260Updated 4 months ago
- Official code for "DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics" (NeurIPS 2023)☆93Updated 5 months ago
- The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆64Updated 3 months ago
- Official implementation of FouriScale (ECCV2024)☆131Updated last month
- Unified Multi-modal IAA Baseline and Benchmark☆68Updated 5 months ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆107Updated 3 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆13Updated last week
- Scaling Diffusion Transformers with Mixture of Experts☆178Updated last week
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆111Updated 4 months ago
- 从零手搓Flow Matching(Rectified Flow)☆59Updated last week
- The official PyTorch implementation of Fast Diffusion Model☆91Updated last year
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆32Updated last month
- ☆39Updated 6 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆32Updated 6 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆22Updated last week