aimagelab / MaPeT
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Updated 3 weeks ago
Alternatives and similar repositories for MaPeT:
Users that are interested in MaPeT are comparing it to the libraries listed below
- ☆52Updated last year
- Official Codes and Pretrained Models for RecursiveMix☆22Updated last year
- ☆18Updated 4 months ago
- Teach-DETR: Better Training DETR with Teachers☆30Updated 11 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆34Updated 2 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆15Updated 10 months ago
- code base for vision transformers☆36Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Code of "What Images are More Memorable to Machines?"☆15Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- i-mae Pytorch Repo☆20Updated 10 months ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆41Updated 5 months ago
- ☆43Updated last year
- Implementation for paper: Self-Regulation for Semantic Segmentation☆31Updated 3 years ago
- [CVPR 2022] Official PyTorch implementation for Attributable Visual Similarity Learning☆34Updated 2 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆58Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Updated 3 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago
- [CVPR 2023] Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification☆21Updated last year
- ☆40Updated last year
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆20Updated 2 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago