VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang
☆80Updated last year
Alternatives and similar repositories for ViT-Anti-Oversmoothing:
Users that are interested in ViT-Anti-Oversmoothing are comparing it to the libraries listed below
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆62Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆55Updated last year
- Hyperbolic Image Segmentation, CVPR 2022☆85Updated 2 years ago
- ☆54Updated last year
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆71Updated 2 years ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Updated last year
- ResMLP: Feedforward networks for image classification with data-efficient training☆42Updated 3 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆49Updated last year
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆82Updated 2 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆27Updated 2 years ago
- ☆43Updated 2 years ago
- Repository containing code for blockwise SSL training☆29Updated 6 months ago
- (Unofficial) PyTorch implementation of the paper Early Convolutions Help Transformers See Better☆43Updated 3 years ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆58Updated 9 months ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆52Updated 5 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆53Updated last year
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆91Updated last year
- Information Bottleneck Approach to Spatial Attention Learning, IJCAI2021☆15Updated 3 years ago
- Transformers w/o Attention, based fully on MLPs☆92Updated last year
- [ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization☆25Updated last year
- ☆61Updated 2 years ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆60Updated 11 months ago
- Log-Polar Space Convolution for Convolutional Neural Networks☆12Updated 2 years ago