lvyufeng / denoising-diffusion-mindsporeLinks
Implementation of Denoising Diffusion Probabilistic Model in MindSpore
☆36Updated 2 years ago
Alternatives and similar repositories for denoising-diffusion-mindspore
Users that are interested in denoising-diffusion-mindspore are comparing it to the libraries listed below
Sorting:
- DeepSpeed Tutorial☆97Updated 10 months ago
- 这是一个DiT-pytorch的代码,主要用于学习DiT结构。☆78Updated last year
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 6 months ago
- ☆105Updated 11 months ago
- Efficient Mixture of Experts for LLM Paper List☆77Updated 6 months ago
- Lion and Adam optimization comparison☆61Updated 2 years ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆64Updated last month
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆52Updated 3 months ago
- ☆18Updated last year
- ☆21Updated last year
- ☆38Updated 11 months ago
- A demo of image classification with PyTorch DDP (DistributedDataParallel) and AMP (Automatic Mixed Precision) modules.☆68Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆102Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆123Updated last year
- ☆18Updated 2 years ago
- ☆42Updated 4 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆106Updated 10 months ago
- LLM Tokenizer with BPE algorithm☆32Updated last year
- MindSpore implementations of Generative Adversarial Networks.☆22Updated 2 years ago
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆121Updated 2 years ago
- ☆62Updated 3 years ago
- GroupMixAttention and GroupMixFormer☆117Updated last year
- ☆24Updated last year
- ☆44Updated 2 years ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆169Updated last year
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆170Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- A repository for DenseSSMs☆87Updated last year
- ☆137Updated last month
- ☆191Updated last year