aimagelab / MaPeT
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MaPeT
- ☆18Updated last month
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- ☆52Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆65Updated 3 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated 11 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆28Updated 5 months ago
- Code of "What Images are More Memorable to Machines?"☆15Updated last year
- Teach-DETR: Better Training DETR with Teachers☆29Updated 8 months ago
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆19Updated 2 years ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆11Updated last week
- ☆19Updated 3 months ago
- ☆39Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated last year
- ☆44Updated last year
- ☆16Updated last year
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆34Updated 7 months ago
- code base for vision transformers☆36Updated 2 years ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆36Updated 2 weeks ago
- Official Codes and Pretrained Models for RecursiveMix☆22Updated last year
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning☆56Updated 3 months ago
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆14Updated 7 months ago
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆13Updated 5 months ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆56Updated last year
- ☆21Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆42Updated last year
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 6 months ago
- ☆13Updated 7 months ago