hikvision-research / Unified-Normalization
# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang Pu. This repository is the official implementation of "Unified Normalization for Accelerating and Stabilizing Transformers"
☆35Updated last year
Related projects: ⓘ
- An object detection codebase based on MegEngine.☆28Updated last year
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆17Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆71Updated 9 months ago
- RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆13Updated 2 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- Slides with modifications for a course at Tsinghua University.☆57Updated 2 years ago
- ☆13Updated last year
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- ☆34Updated 3 months ago
- ☆21Updated 5 months ago
- Code for RepNAS☆13Updated 2 years ago
- ☆19Updated 3 years ago
- Official implementation for paper "DyRep: Bootstrapping Training with Dynamic Re-parameterization", CVPR 2022☆42Updated 2 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated last year
- ☆20Updated last year
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆27Updated 2 weeks ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆60Updated 2 months ago
- Keras implement of Finite Scalar Quantization☆58Updated 10 months ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆25Updated last year
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆42Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆63Updated 3 weeks ago
- PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)☆75Updated last year
- [CVPR-2023] Towards Any Structural Pruning☆17Updated last year
- BESA is a differentiable weight pruning technique for large language models.☆12Updated 6 months ago
- ☆39Updated 10 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆15Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆13Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆69Updated 2 years ago
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆29Updated last year
- useful dotfiles included vim, zsh, tmux and vscode☆17Updated 3 weeks ago