Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
☆245Dec 3, 2022Updated 3 years ago
Alternatives and similar repositories for UM-MAE
Users that are interested in UM-MAE are comparing it to the libraries listed below
Sorting:
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆524Mar 14, 2023Updated 2 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,024Sep 29, 2022Updated 3 years ago
- Reading list for research topics in Masked Image Modeling☆338Dec 3, 2024Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- ☆59Jun 17, 2022Updated 3 years ago
- (CVPR2023)Dense Distinct Query for End-to-End Object Detection☆264May 24, 2023Updated 2 years ago
- Official Codes and Pretrained Models for RecursiveMix☆22Apr 24, 2023Updated 2 years ago
- This repo holds the research projects of our lab.☆12Jan 20, 2024Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Oct 11, 2022Updated 3 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆146Jul 2, 2023Updated 2 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆463May 9, 2022Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆80Aug 30, 2022Updated 3 years ago
- Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)☆199Aug 24, 2022Updated 3 years ago
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆156Aug 19, 2023Updated 2 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆615Dec 13, 2022Updated 3 years ago
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆400Oct 23, 2024Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆198Jan 11, 2023Updated 3 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- reproduction of semantic segmentation using masked autoencoder (mae)☆170Feb 3, 2022Updated 4 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆237Aug 17, 2022Updated 3 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆177Jan 16, 2023Updated 3 years ago
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆379Sep 16, 2022Updated 3 years ago
- This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…☆400May 22, 2023Updated 2 years ago
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆57Jan 17, 2024Updated 2 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,230Jul 23, 2024Updated last year
- Official implementation of PVT series☆1,887Oct 27, 2022Updated 3 years ago
- iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)☆765Apr 14, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- Replication of Pix2Seq with Pretrained Model☆59Nov 6, 2021Updated 4 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆67Jun 4, 2023Updated 2 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆84Aug 16, 2022Updated 3 years ago
- Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)☆138May 24, 2023Updated 2 years ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆782May 10, 2022Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆482Mar 31, 2023Updated 2 years ago