Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
☆26Apr 12, 2022Updated 3 years ago
Alternatives and similar repositories for beyond_masking
Users that are interested in beyond_masking are comparing it to the libraries listed below
Sorting:
- (SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning☆26Mar 29, 2022Updated 3 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆67Jun 4, 2023Updated 2 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 6 months ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Oct 19, 2021Updated 4 years ago
- ☆11Feb 1, 2024Updated 2 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Jul 27, 2021Updated 4 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆158Dec 6, 2022Updated 3 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆185Nov 17, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.☆106Aug 1, 2023Updated 2 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- Code of "NeuSample: Neural Sample Field for Efficient View Synthesis"☆37Oct 10, 2022Updated 3 years ago
- This is the official code repo for "RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?"☆38Dec 30, 2021Updated 4 years ago
- A general purpose web app for connecting participants to engage in realtime conversations based on generated prompts.☆20Jun 21, 2023Updated 2 years ago
- [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"☆45Mar 25, 2025Updated 11 months ago
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- ☆41Sep 21, 2023Updated 2 years ago
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Feb 4, 2022Updated 4 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆80Jan 7, 2026Updated 2 months ago
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆42Apr 30, 2024Updated last year
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- 'Discretization-Aware Architecture Search' alleviates the discretization gap in one-shot differentiable NAS. DAAS has been accepted by PR…☆20Jul 30, 2021Updated 4 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 2 years ago
- Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"☆45Aug 25, 2022Updated 3 years ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- Reading list for research topics in Masked Image Modeling☆338Dec 3, 2024Updated last year
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- Paper Today I Read☆27Jan 27, 2026Updated last month
- ☆19Jan 2, 2023Updated 3 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆197Jan 11, 2023Updated 3 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆211Jul 28, 2024Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99May 2, 2022Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago