mindspore-lab / minddiffusionLinks
A collection of diffusion models based on MindSpore
☆161Updated last year
Alternatives and similar repositories for minddiffusion
Users that are interested in minddiffusion are comparing it to the libraries listed below
Sorting:
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆262Updated this week
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆179Updated 2 years ago
- one for all, Optimal generator with No Exception☆459Updated last week
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆191Updated last year
- A toolbox of vision models and algorithms based on MindSpore☆261Updated 3 months ago
- The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…☆501Updated last year
- ☆24Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆342Updated last year
- 生成扩散模型的Keras实现☆315Updated 8 months ago
- ☆116Updated 2 years ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆59Updated 2 years ago
- ☆192Updated last year
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,054Updated 2 years ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆310Updated last year
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆594Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated 2 weeks ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆389Updated last year
- DeepSpeed Tutorial☆102Updated last year
- 多模态 MM +Chat 合集☆276Updated 2 months ago
- ☆103Updated last year
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆277Updated 2 months ago
- ☆512Updated 2 years ago
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆604Updated last year
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,097Updated 10 months ago
- diffusion-based layout-to-image generation model☆318Updated 6 months ago
- The pure and clear PyTorch Distributed Training Framework.☆274Updated last year
- A collection of awesome text-to-image generation studies.☆686Updated last week
- Materials for the Hugging Face Diffusion Models Course☆237Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,444Updated 2 years ago