microsoft / ExtreMA
A self-supervised learning approach based on extremely large masking
☆30Updated 2 years ago
Alternatives and similar repositories for ExtreMA:
Users that are interested in ExtreMA are comparing it to the libraries listed below
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- ☆28Updated 3 years ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆109Updated 2 months ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆34Updated 2 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆100Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated last year
- ScaleNet: Searching for the Model to Scale (ECCV 2022)☆12Updated 2 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆68Updated 3 years ago
- [ECCV2022] Dense Siamese Network for Dense Unsupervised Learning☆28Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- ☆12Updated 3 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022☆25Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆34Updated 2 years ago
- ☆42Updated 2 years ago
- Example code for OCDA-Driving☆15Updated 4 years ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆52Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Updated 3 years ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆25Updated 8 months ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆69Updated last year