lvyufeng / easy_mindspore
☆18Updated last year
Related projects: ⓘ
- mindspore implementation of transformers☆65Updated last year
- [KDD'22] Learned Token Pruning for Transformers☆91Updated last year
- ☆59Updated 2 months ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆34Updated 8 months ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆45Updated 9 months ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆77Updated last year
- The official implementation of the ICML 2023 paper OFQ-ViT☆27Updated 11 months ago
- A Tight-fisted Optimizer☆46Updated last year
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆78Updated last year
- Must-read papers on improving efficiency for pre-trained language models.☆100Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆33Updated 6 months ago
- Vision Transformer Pruning☆52Updated 2 years ago
- ☆18Updated 9 months ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆88Updated last year
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆30Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆38Updated 10 months ago
- This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…☆14Updated last year
- Pytorch implementation of TPAMI 2022 -- 1xN Pattern for Pruning Convolutional Neural Networks☆43Updated 2 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆69Updated 2 years ago
- ☆15Updated last year
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆90Updated 9 months ago
- This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.☆81Updated last year
- ☆33Updated 2 years ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆40Updated 3 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆69Updated 6 months ago
- Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"☆61Updated 2 months ago
- Post-Training Quantization for Vision transformers.☆176Updated 2 years ago
- Python Scritpt which can be embedded into PyTorch model to print the model size.☆18Updated 3 years ago
- ☆16Updated 2 years ago