[CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"
☆70Jul 2, 2025Updated 7 months ago
Alternatives and similar repositories for maskalign
Users that are interested in maskalign are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆84Aug 16, 2022Updated 3 years ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆107Apr 16, 2025Updated 10 months ago
- The multi-view version of MonoDETR on nuScenes dataset☆20Nov 4, 2022Updated 3 years ago
- Official codes for ConMIM (ICLR 2023)☆58Feb 8, 2023Updated 3 years ago
- ☆19Sep 24, 2024Updated last year
- ☆59Jun 17, 2022Updated 3 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆524Mar 14, 2023Updated 2 years ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Feb 28, 2024Updated 2 years ago
- The official code for the paper Evolved Part Masking for Self-Supervised Learning.☆16Jun 14, 2023Updated 2 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆146Jul 2, 2023Updated 2 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆67Jun 4, 2023Updated 2 years ago
- A self-supervised learning approach based on extremely large masking☆31Dec 19, 2022Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆57Jan 17, 2024Updated 2 years ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆42Jun 18, 2023Updated 2 years ago
- ☆18Aug 23, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆39Jan 10, 2023Updated 3 years ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Dec 29, 2022Updated 3 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,024Sep 29, 2022Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- Pytorch code for Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency (NeurIPS 2022)☆20Oct 10, 2022Updated 3 years ago
- ☆72Mar 10, 2025Updated 11 months ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- ☆46Feb 23, 2023Updated 3 years ago
- The code for On Robust Cross-View Consistency in Outdoor Self-Supervised Monocular Depth Estimation☆13Jun 2, 2023Updated 2 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆177Jan 16, 2023Updated 3 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆109Jul 24, 2023Updated 2 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆32Dec 7, 2023Updated 2 years ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆74Apr 18, 2024Updated last year
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated 11 months ago
- [ICCV 2025] Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation☆110Jan 6, 2026Updated last month
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆245Dec 3, 2022Updated 3 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Jul 10, 2023Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆56May 10, 2025Updated 9 months ago
- ☆30Feb 15, 2026Updated last week