mt-cly / SimCMFLinks
SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality
☆34Updated 9 months ago
Alternatives and similar repositories for SimCMF
Users that are interested in SimCMF are comparing it to the libraries listed below
Sorting:
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Updated 10 months ago
- [TPAMI 2023] Code for inference of our TPAMI and ECCV papers on model-guided disentanglement for GANs.☆27Updated 2 years ago
- ☆31Updated last year
- ☆32Updated last year
- ☆30Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆48Updated last month
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆23Updated last year
- ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models☆18Updated last year
- ☆39Updated last year
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆48Updated 3 weeks ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆41Updated last month
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- This repository is for the first survey on SAM & SAM2 for Videos.☆52Updated 4 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆43Updated last year
- Code for Learning to Zoom and Unzoom (CVPR 2023)☆47Updated 2 years ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆82Updated 9 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆19Updated 10 months ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆25Updated 5 months ago
- ☆12Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆58Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆87Updated 5 months ago
- [ICCV 2023]The PyTorch implementation of TL-Align: Token-Label Alignment for Vision Transformers.☆23Updated 2 years ago
- Video Diffusion State Space Models☆19Updated last year
- Official implementation of "Can Language Understand Depth?"☆82Updated 2 years ago
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12Updated 3 months ago
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆20Updated last year
- Official implementation of "WorDepth: Variational Language Prior for Monocular Depth Estimation"☆41Updated 6 months ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆13Updated 8 months ago