JeongsooP / RGB-no-more
An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers
☆54Updated last year
Alternatives and similar repositories for RGB-no-more:
Users that are interested in RGB-no-more are comparing it to the libraries listed below
- A simple minimal implementation of Reversible Vision Transformers☆120Updated 10 months ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆105Updated 10 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 9 months ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆152Updated 2 years ago
- ☆49Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆81Updated 6 months ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)☆84Updated last year
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 2 years ago
- ☆52Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆52Updated 5 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆70Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆98Updated last year
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆109Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆83Updated 11 months ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆99Updated last year
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆77Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆62Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated last year
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆92Updated last year
- ☆33Updated 2 years ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆99Updated last month
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆98Updated 2 years ago
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆175Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆78Updated last year
- The official github repo for "Test-Time Training with Masked Autoencoders"☆80Updated last year
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆33Updated last year
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆46Updated last year