lvyufeng / easy_mindspore_bk
☆18Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for easy_mindspore_bk
- mindspore implementation of transformers☆66Updated last year
- Must-read papers on improving efficiency for pre-trained language models.☆102Updated 2 years ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆37Updated 10 months ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated last year
- [KDD'22] Learned Token Pruning for Transformers☆93Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆77Updated 8 months ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 9 months ago
- ☆14Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆42Updated 3 years ago
- MindSpore implementations of Generative Adversarial Networks.☆21Updated 2 years ago
- A Tight-fisted Optimizer☆47Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆90Updated 2 years ago
- Grab GPU whenever available☆279Updated 2 years ago
- Python Scritpt which can be embedded into PyTorch model to print the model size.☆18Updated 3 years ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆184Updated 6 months ago
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆32Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆79Updated last year
- ☆14Updated last month
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆191Updated last year
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆88Updated this week
- ☆74Updated 11 months ago
- Official PyTorch implementation of IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact☆32Updated 5 months ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆27Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆82Updated last year
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆232Updated last year
- PyTorch Dataset Rank Dataset☆40Updated 3 years ago
- Lion and Adam optimization comparison☆56Updated last year
- Ladder Side-Tuning在CLUE上的简单尝试☆19Updated 2 years ago
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 2 years ago
- ☆155Updated last month