hikvision-research / Unified-Normalization
# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang Pu. This repository is the official implementation of "Unified Normalization for Accelerating and Stabilizing Transformers"
☆34Updated last year
Alternatives and similar repositories for Unified-Normalization:
Users that are interested in Unified-Normalization are comparing it to the libraries listed below
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ☆13Updated last year
- Code for RepNAS☆13Updated 3 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆74Updated last year
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆30Updated 6 months ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆33Updated last year
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Updated 3 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆19Updated 2 months ago
- ☆21Updated 3 months ago
- ☆20Updated 2 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Updated last year
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆70Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- ☆34Updated last year
- Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction☆10Updated 2 years ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆26Updated 2 years ago
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 11 months ago
- Pytorch implementation of RAPQ, IJCAI 2022☆21Updated last year
- TVMScript kernel for deformable attention☆24Updated 3 years ago
- ☆19Updated 4 years ago
- Triton implement of bi-directional (non-causal) linear attention☆41Updated 2 weeks ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2021 -- Network Pruning using Adaptive Exemplar Filters☆22Updated 3 years ago
- Slides with modifications for a course at Tsinghua University.☆58Updated 2 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 6 months ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆41Updated 4 months ago