mindspore-lab / minddiffusion
A collection of diffusion models based on MindSpore
☆161Updated last year
Alternatives and similar repositories for minddiffusion:
Users that are interested in minddiffusion are comparing it to the libraries listed below
- A toolbox of vision models and algorithms based on MindSpore☆245Updated last week
- one for all, Optimal generator with No Exception☆411Updated 2 weeks ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆994Updated 2 years ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆228Updated 3 weeks ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆298Updated 11 months ago
- MindFace is an open source toolkit based on MindSpore, containing the most advanced face recognition and detection models, such as ArcFa…☆46Updated 2 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆180Updated last year
- ☆161Updated 2 weeks ago
- 生成扩散模型的Keras实现☆284Updated 2 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆577Updated 6 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆912Updated 6 months ago
- diffusion-based layout-to-image generation model☆298Updated 2 weeks ago
- 🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"☆239Updated 2 years ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆560Updated last year
- MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer☆223Updated last year
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆561Updated 2 years ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆56Updated 2 years ago
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆333Updated 7 months ago
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆886Updated 9 months ago
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆296Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,411Updated last year
- Implementation of MagViT2 Tokenizer in Pytorch☆600Updated 3 months ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆434Updated 4 months ago
- ☆25Updated last year
- huggingface mirror download☆574Updated 3 weeks ago
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆136Updated 2 years ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆410Updated 5 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆811Updated last year
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆236Updated 11 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆159Updated last year