mindspore-lab / minddiffusion
A collection of diffusion models based on MindSpore
☆160Updated last year
Alternatives and similar repositories for minddiffusion:
Users that are interested in minddiffusion are comparing it to the libraries listed below
- A toolbox of vision models and algorithms based on MindSpore☆241Updated 2 months ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆213Updated 2 weeks ago
- one for all, Optimal generator with No Exception☆381Updated this week
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆965Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆326Updated 4 months ago
- MindFace is an open source toolkit based on MindSpore, containing the most advanced face recognition and detection models, such as ArcFa…☆46Updated this week
- ☆152Updated this week
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆759Updated 11 months ago
- 生成扩散模型的Keras实现☆256Updated this week
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆901Updated 4 months ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated last year
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆177Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,396Updated last year
- Simple tutorials on Pytorch DDP training☆273Updated 2 years ago
- Implementation of MagViT2 Tokenizer in Pytorch☆588Updated last month
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆87Updated 11 months ago
- PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-…☆279Updated last year
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆54Updated last year
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,053Updated last month
- SEED-Voken: A Series of Powerful Visual Tokenizers☆821Updated this week
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆284Updated 9 months ago
- A collection of awesome text-to-image generation studies.☆515Updated this week
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆94Updated last month
- [ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…☆227Updated 9 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆547Updated 9 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆571Updated 3 months ago
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,255Updated last month
- Diffusion Model-Based Image Editing: A Survey (arXiv)☆555Updated this week
- ☆103Updated 10 months ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆381Updated 7 months ago