lvyufeng / denoising-diffusion-mindsporeLinks
Implementation of Denoising Diffusion Probabilistic Model in MindSpore
☆39Updated 2 years ago
Alternatives and similar repositories for denoising-diffusion-mindspore
Users that are interested in denoising-diffusion-mindspore are comparing it to the libraries listed below
Sorting:
- Lion and Adam optimization comparison☆61Updated 2 years ago
- DeepSpeed Tutorial☆98Updated 11 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆112Updated last year
- 《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。☆118Updated last year
- 这是一个DiT-pytorch的代码,主要用于学习DiT结构。☆78Updated last year
- ☆67Updated 2 years ago
- Efficient Mixture of Experts for LLM Paper List☆79Updated 7 months ago
- deep learning template code☆66Updated last year
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆110Updated 3 months ago
- A demo of image classification with PyTorch DDP (DistributedDataParallel) and AMP (Automatic Mixed Precision) modules.☆69Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆169Updated last year
- ☆108Updated last year
- A Tight-fisted Optimizer☆48Updated 2 years ago
- ☆74Updated last year
- ☆39Updated 11 months ago
- ☆40Updated last year
- LLM Tokenizer with BPE algorithm☆32Updated last year
- ☆23Updated 2 years ago
- 一个用于学习的仿Pytorch纯Python实现的自动求导工具。☆51Updated last year
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆106Updated last week
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆148Updated 3 weeks ago
- ☆42Updated 5 months ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 7 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆106Updated 10 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆55Updated last week
- ☆195Updated last year
- ☆147Updated 10 months ago
- ☆24Updated last year
- ☆202Updated 8 months ago
- ☆167Updated this week