mindspore-lab / minddiffusionLinks

A collection of diffusion models based on MindSpore

☆161

Alternatives and similar repositories for minddiffusion

Users that are interested in minddiffusion are comparing it to the libraries listed below

Sorting:

kyegomez / NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
☆262Updated this week
bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆179Updated 2 years ago
mindspore-lab / mindone
one for all, Optimal generator with No Exception
☆459Updated last week
ksOAn6g5 / TaiSu
TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）
☆191Updated last year
mindspore-lab / mindcv
A toolbox of vision models and algorithms based on MindSpore
☆261Updated 3 months ago
lichao-sun / SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…
☆501Updated last year
MFaceTech / AIGC-SD-Acceleration
☆24Updated last year
xxcheng0708 / pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用
☆122Updated last year
NUS-HPC-AI-Lab / InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
☆342Updated last year
bojone / Keras-DDPM
生成扩散模型的Keras实现
☆315Updated 8 months ago
opendatalab / laion5b-downloader
☆116Updated 2 years ago
WGS-note / finetune_stable_diffusion
finetune stable diffusion with Dreambooth、LoRA、ControlNet
☆59Updated 2 years ago
Kwai-Kolors / MPS
☆192Updated last year
baofff / U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆1,054Updated 2 years ago
zai-org / RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
☆310Updated last year
jy0205 / LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆594Updated last year
alipay / Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
☆169Updated 2 weeks ago
Meituan-AutoML / VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
☆389Updated last year
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆102Updated last year
hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆276Updated 2 months ago
datawhalechina / sora-tutorial
☆103Updated last year
Victorwz / Open-Qwen2VL
[COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
☆277Updated 2 months ago
xiaohu2015 / nngen
☆512Updated 2 years ago
tgxs002 / HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
☆604Updated last year
THUDM / SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
☆1,097Updated 10 months ago
ZGCTroy / LayoutDiffusion
diffusion-based layout-to-image generation model
☆318Updated 6 months ago
BIGBALLON / distribuuuu
The pure and clear PyTorch Distributed Training Framework.
☆274Updated last year
AlonzoLeeeooo / awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
☆686Updated last week
darcula1993 / diffusion-models-class-CN
Materials for the Hugging Face Diffusion Models Course
☆237Updated 2 years ago
thu-ml / unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
☆1,444Updated 2 years ago