VITA-Group / ViT-Anti-OversmoothingLinks
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang
☆80Updated last year
Alternatives and similar repositories for ViT-Anti-Oversmoothing
Users that are interested in ViT-Anti-Oversmoothing are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆64Updated 2 years ago
- Repository containing code for blockwise SSL training☆29Updated 8 months ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 3 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆50Updated last year
- ☆27Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- Unofficial PyTorch implementation of the paper "Generating images with sparse representations"☆38Updated 4 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆76Updated 2 years ago
- [ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization☆25Updated last year
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆53Updated 8 months ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆64Updated 2 months ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆84Updated 4 months ago
- (Unofficial) PyTorch implementation of the paper Early Convolutions Help Transformers See Better☆43Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- Description: Frequency Augmented Variational Autoencoder for better Image Reconstruction☆40Updated last year
- Transformers w/o Attention, based fully on MLPs☆93Updated last year
- ☆25Updated 3 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year
- ☆54Updated last year
- ☆19Updated 2 years ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆55Updated last year
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆26Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆59Updated last year
- ☆61Updated 2 years ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆62Updated last year
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- ☆16Updated last year