hikvision-research / Unified-Normalization
# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang Pu. This repository is the official implementation of "Unified Normalization for Accelerating and Stabilizing Transformers"
☆34Updated 2 years ago
Alternatives and similar repositories for Unified-Normalization:
Users that are interested in Unified-Normalization are comparing it to the libraries listed below
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆33Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ☆13Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Updated 3 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆26Updated 2 years ago
- Code for RepNAS☆13Updated 3 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆30Updated 7 months ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆75Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆40Updated this week
- ☆19Updated 4 years ago
- ☆20Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆42Updated 6 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆26Updated last year
- Slides with modifications for a course at Tsinghua University.☆59Updated 2 years ago
- ☆22Updated 5 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆19Updated 3 months ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆66Updated 8 months ago
- PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)☆75Updated 2 years ago
- TVMScript kernel for deformable attention☆25Updated 3 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆71Updated 2 years ago
- https://hyperbox-doc.readthedocs.io/en/latest/☆25Updated last year
- BESA is a differentiable weight pruning technique for large language models.☆14Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 7 months ago