hikvision-research / Unified-Normalization
# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang Pu. This repository is the official implementation of "Unified Normalization for Accelerating and Stabilizing Transformers"
☆34Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Unified-Normalization
- An object detection codebase based on MegEngine.☆28Updated last year
- ☆19Updated 3 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆71Updated last year
- Slides with modifications for a course at Tsinghua University.☆57Updated 2 years ago
- ☆13Updated last year
- Code for RepNAS☆13Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆36Updated 8 months ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 2 years ago
- ☆21Updated 3 weeks ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆20Updated 5 months ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆13Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆19Updated 8 months ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆29Updated 3 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆16Updated 4 months ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆66Updated 4 months ago
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 8 months ago
- TVMScript kernel for deformable attention☆24Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆66Updated 2 years ago
- PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)☆75Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated 2 years ago
- MegEngine到其他框架的转换器☆67Updated last year
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated last year
- ☆20Updated 2 years ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆42Updated last year
- MegEngine implementation of Diffusion Models.☆16Updated 2 years ago
- Batch Normalization Auto-fusion for PyTorch☆32Updated 4 years ago
- ☆17Updated 2 years ago