VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang
☆80Updated last year
Alternatives and similar repositories for ViT-Anti-Oversmoothing:
Users that are interested in ViT-Anti-Oversmoothing are comparing it to the libraries listed below
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆62Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 3 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆49Updated 11 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- Repository containing code for blockwise SSL training☆28Updated 5 months ago
- ☆27Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago
- (Unofficial) PyTorch implementation of the paper Early Convolutions Help Transformers See Better☆43Updated 3 years ago
- [ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization☆25Updated 11 months ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆50Updated 5 months ago
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆58Updated last year
- Hyperbolic Image Segmentation, CVPR 2022☆84Updated 2 years ago
- Information Bottleneck Approach to Spatial Attention Learning, IJCAI2021☆14Updated 3 years ago
- ☆42Updated 2 years ago
- ☆21Updated 2 years ago
- ☆54Updated last year
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 9 months ago
- Log-Polar Space Convolution for Convolutional Neural Networks☆12Updated 2 years ago
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆42Updated this week
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- Official repository for "Orthogonal Projection Loss" (ICCV'21)☆120Updated 3 years ago
- ☆52Updated 2 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆114Updated 2 years ago
- ☆25Updated 3 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆42Updated 3 years ago
- ☆19Updated 2 years ago