lvyufeng / easy_mindspore_bkLinks
☆18Updated 2 years ago
Alternatives and similar repositories for easy_mindspore_bk
Users that are interested in easy_mindspore_bk are comparing it to the libraries listed below
Sorting:
- A Tight-fisted Optimizer☆48Updated 2 years ago
- [KDD'22] Learned Token Pruning for Transformers☆98Updated 2 years ago
- qwen-nsa☆71Updated 4 months ago
- [AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models☆57Updated last year
- Lion and Adam optimization comparison☆63Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆86Updated 2 years ago
- Efficient Mixture of Experts for LLM Paper List☆87Updated 7 months ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆45Updated 2 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Updated last year
- ☆61Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆29Updated last year
- ☆13Updated 2 years ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆196Updated 2 years ago
- [ACL 2024] Official PyTorch implementation of "IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact"☆47Updated last year
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆175Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆72Updated 3 years ago
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆36Updated 2 years ago
- This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.☆88Updated 2 years ago
- ☆21Updated 4 months ago
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆61Updated 2 years ago
- ☆59Updated last year
- Datasets, Transforms and Models specific to Computer Vision☆87Updated last year
- ☆11Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20Updated 2 months ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆54Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆18Updated 2 years ago