ltpwy / MSCILinks
☆18Updated 2 months ago
Alternatives and similar repositories for MSCI
Users that are interested in MSCI are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of ZSLViT (CVPR'24)☆14Updated last year
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆15Updated last year
- ☆26Updated last year
- ☆23Updated 10 months ago
- The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)☆34Updated last month
- [CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation☆19Updated last month
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆320Updated 3 weeks ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆627Updated last week
- Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)☆23Updated this week
- ☆16Updated last month
- This is the official implementation for our CVPR2024 paper "Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation".…☆46Updated last year
- [AAAI2024] Official implementation of TGP-T☆28Updated last year
- The official implementation of AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP☆113Updated 2 months ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆44Updated last year
- Neurips 2024☆38Updated last month
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆25Updated 4 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆40Updated 2 months ago
- Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)☆23Updated last year
- The official implementation of VLPL: Vision Language Pseudo Label for Multi-label Learning with Single Positive Labels☆16Updated 7 months ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆14Updated last year
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆41Updated 4 months ago
- A collection of papers, datasets, benchmarks, code, and model weights for Remote Sensing Cross-Modal Image-Text Retrieval (RSCMIT).☆20Updated 5 months ago
- PyTorch implementation for Robust Contrastive Cross-modal Hashing with Noisy Labels. (ACM Multimedia 2024).☆12Updated 9 months ago
- Official code Implementation of "Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP" (AAA…☆14Updated 7 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆77Updated 2 months ago
- [ICCV 2023] Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision☆10Updated last year
- ☆61Updated 7 months ago
- [ChinaMM2025] 非空间配准多模态目标检测决策融合 策略☆38Updated last week
- Adaptive FSS has been Accepted by AAAI 2024. A Novel Few-Shot Segmentation Framework via Prototype Enhancement☆39Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆52Updated 3 months ago