implus/UM-MAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/implus/UM-MAE)

implus / UM-MAE

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

☆245

Alternatives and similar repositories for UM-MAE

Users that are interested in UM-MAE are comparing it to the libraries listed below

Sorting:

Alpha-VL / ConvMAE
View on GitHub
ConvMAE: Masked Convolution Meets Masked Autoencoders
☆524Mar 14, 2023Updated 2 years ago
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,024Sep 29, 2022Updated 3 years ago
ucasligang / awesome-MIM
View on GitHub
Reading list for research topics in Masked Image Modeling
☆338Dec 3, 2024Updated last year
facebookresearch / long_seq_mae
View on GitHub
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Oct 14, 2022Updated 3 years ago
Alpha-VL / FastConvMAE
View on GitHub
☆59Jun 17, 2022Updated 3 years ago
jshilong / DDQ
View on GitHub
(CVPR2023)Dense Distinct Query for End-to-End Object Detection
☆264May 24, 2023Updated 2 years ago
implus / RecursiveMix-pytorch
View on GitHub
Official Codes and Pretrained Models for RecursiveMix
☆22Apr 24, 2023Updated 2 years ago
IMPlus-PCALab / Research
View on GitHub
This repo holds the research projects of our lab.
☆12Jan 20, 2024Updated 2 years ago
ucasligang / SimViT
View on GitHub
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆68Oct 11, 2022Updated 3 years ago
Sense-X / MixMIM
View on GitHub
MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning
☆146Jul 2, 2023Updated 2 years ago
facebookresearch / msn
View on GitHub
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463May 9, 2022Updated 3 years ago
enyac-group / supmae
View on GitHub
This is a offical PyTorch/GPU implementation of SupMAE.
☆80Aug 30, 2022Updated 3 years ago
JIA-Lab-research / SA-AutoAug
View on GitHub
Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)
☆199Aug 24, 2022Updated 3 years ago
lxtGH / Video-K-Net
View on GitHub
[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
☆156Aug 19, 2023Updated 2 years ago
EPFL-VILAB / MultiMAE
View on GitHub
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆615Dec 13, 2022Updated 3 years ago
amazon-science / bigdetection
View on GitHub
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
☆400Oct 23, 2024Updated last year
lxtGH / CAE
View on GitHub
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
☆198Jan 11, 2023Updated 3 years ago
CASIA-LMC-Lab / Obj2Seq
View on GitHub
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Nov 2, 2022Updated 3 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆787Feb 9, 2023Updated 3 years ago
implus / mae_segmentation
View on GitHub
reproduction of semantic segmentation using masked autoencoder (mae)
☆170Feb 3, 2022Updated 4 years ago
microsoft / UniTAB
View on GitHub
UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)
☆89Jun 12, 2023Updated 2 years ago
MCG-NJU / AdaMixer
View on GitHub
[CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector
☆237Aug 17, 2022Updated 3 years ago
LayneH / GreenMIM
View on GitHub
[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.
☆177Jan 16, 2023Updated 3 years ago
ShoufaChen / AdaptFormer
View on GitHub
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
☆379Sep 16, 2022Updated 3 years ago
Atten4Vis / ConditionalDETR
View on GitHub
This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…
☆400May 22, 2023Updated 2 years ago
megvii-research / SOLQ
View on GitHub
"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
☆200Apr 17, 2022Updated 3 years ago
liuxingbin / dbot
View on GitHub
[ICLR2024] Exploring Target Representations for Masked Autoencoders
☆57Jan 17, 2024Updated 2 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,230Jul 23, 2024Updated last year
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,887Oct 27, 2022Updated 3 years ago
bytedance / ibot
View on GitHub
iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
☆765Apr 14, 2022Updated 3 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆291Apr 25, 2022Updated 3 years ago
gaopengcuhk / Pretrained-Pix2Seq
View on GitHub
Replication of Pix2Seq with Pretrained Model
☆59Nov 6, 2021Updated 4 years ago
VITA-Group / AsViT
View on GitHub
[ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…
☆76Feb 21, 2022Updated 4 years ago
quanlin-wu / dmae
View on GitHub
Denoising Masked Autoencoders Help Robust Classification.
☆67Jun 4, 2023Updated 2 years ago
zejiangh / MILAN
View on GitHub
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…
☆84Aug 16, 2022Updated 3 years ago
jshilong / GroupRCNN
View on GitHub
Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)
☆138May 24, 2023Updated 2 years ago
NVlabs / GroupViT
View on GitHub
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
☆782May 10, 2022Updated 3 years ago
NVlabs / FAN
View on GitHub
Official PyTorch implementation of Fully Attentional Networks
☆482Mar 31, 2023Updated 2 years ago