microsoft / ExtreMA
A self-supervised learning approach based on extremely large masking
☆30Updated 2 years ago
Alternatives and similar repositories for ExtreMA:
Users that are interested in ExtreMA are comparing it to the libraries listed below
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆110Updated last month
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- a novel data augmentation method across data modalities☆72Updated last year
- ☆52Updated 2 years ago
- On-Device Domain Generalization☆41Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆59Updated last year
- GroupViT: Semantic Segmentation Emerges from Text Supervision☆25Updated 2 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆25Updated 9 months ago
- [ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video☆19Updated last year
- A Unified Framework for Video-Language Understanding☆57Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆41Updated 2 years ago
- ☆12Updated 3 years ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆28Updated 11 months ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆112Updated last year
- ☆44Updated 3 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 2 months ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Edit and Generate Anything in 3D world!☆13Updated last year
- [ECCV2022] Dense Siamese Network for Dense Unsupervised Learning☆28Updated 2 years ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated 2 months ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆58Updated 2 years ago