microsoft / ExtreMA
A self-supervised learning approach based on extremely large masking
☆29Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ExtreMA
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- ☆44Updated 3 years ago
- ☆53Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- ☆34Updated last year
- Example code for OCDA-Driving☆15Updated 3 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- GroupViT: Semantic Segmentation Emerges from Text Supervision☆25Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆23Updated 9 months ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".☆75Updated last year
- A Unified Framework for Video-Language Understanding☆55Updated last year
- REACT (CVPR 2023, Highlight 2.5%)☆134Updated last year
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆86Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆51Updated 3 years ago
- DA-AIM: Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection☆12Updated 2 years ago
- ☆18Updated last year
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)☆52Updated 2 years ago
- The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022☆10Updated 2 years ago
- Paper List for In-context Learning 🌷☆20Updated last year
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆84Updated last year
- Edit and Generate Anything in 3D world!☆11Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆51Updated last year