mindspore-lab / minddiffusionLinks
A collection of diffusion models based on MindSpore
☆163Updated last year
Alternatives and similar repositories for minddiffusion
Users that are interested in minddiffusion are comparing it to the libraries listed below
Sorting:
- one for all, Optimal generator with No Exception☆442Updated last week
- A toolbox of vision models and algorithms based on MindSpore☆257Updated 3 months ago
- 生成扩散模型的Keras实现☆300Updated 5 months ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆246Updated 3 months ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆189Updated last year
- ☆112Updated 2 years ago
- The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…☆497Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆337Updated 9 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,030Updated 2 years ago
- ☆25Updated last year
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆306Updated last year
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆57Updated 2 years ago
- Text-To-Image Generation with Chinese Characters☆130Updated 2 years ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆169Updated last year
- ☆177Updated last year
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆386Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,428Updated 2 years ago
- ☆498Updated 2 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆112Updated last year
- A collection of awesome text-to-image generation studies.☆636Updated 2 months ago
- ☆69Updated 2 years ago
- 多模态 MM +Chat 合集☆271Updated 2 months ago
- [CVPR2023] A faster, smaller, and better text-to-image model for large-scale training☆242Updated last year
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆298Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆158Updated 2 weeks ago
- 🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"☆245Updated 3 years ago
- Materials for the Hugging Face Diffusion Models Course☆230Updated 2 years ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆583Updated 9 months ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆541Updated last year
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆917Updated 9 months ago