dsb-ifi / SPiT
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
☆19Updated 4 months ago
Alternatives and similar repositories for SPiT:
Users that are interested in SPiT are comparing it to the libraries listed below
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 5 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆29Updated 9 months ago
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆37Updated 2 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆18Updated 5 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆13Updated 3 weeks ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 4 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆37Updated 2 months ago
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆20Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- ☆27Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated 9 months ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆15Updated last week
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆38Updated last month
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆45Updated 2 weeks ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Official implementation of the WACV 2024 paper CLIP-DIY☆34Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆79Updated 2 weeks ago
- ☆21Updated 8 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆75Updated this week
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆37Updated 9 months ago
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆48Updated 6 months ago
- ☆30Updated 6 months ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆26Updated 10 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆75Updated 4 months ago
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆40Updated 2 weeks ago
- [CVPR 2024 Highlight] ImageNet-D☆41Updated 5 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆37Updated this week
- Open-vocabulary Semantic Segmentation☆34Updated last year