Maryeon / whiten_mtdLinks
Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"
☆10Updated last year
Alternatives and similar repositories for whiten_mtd
Users that are interested in whiten_mtd are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆13Updated 4 months ago
- ☆21Updated last year
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆20Updated 3 months ago
- [CVPR2025] Synthetic Data is an Elegant GIFT for Continual Vision-Language Models☆16Updated 2 weeks ago
- [CVPR-2024] NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation☆15Updated 8 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆51Updated 10 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆24Updated last year
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆39Updated 3 months ago
- A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.☆14Updated 2 years ago
- [CVPR2025] Official implementation of RAM☆17Updated 3 months ago
- The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…☆26Updated last year
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆12Updated 4 months ago
- ☆13Updated 7 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆53Updated last month
- GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery (CVPR2025)☆22Updated 3 months ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆26Updated 3 months ago
- Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025☆30Updated 4 months ago
- [AAAI 2025] The official repository of our paper "GCD: Advancing Vision-Language Models for Incremental Object Detection via Global Align…☆12Updated last month
- Collection of awesome Continual Test-Time Adaptation methods☆18Updated last year
- ☆27Updated 2 years ago
- This is the GitHub repository for Data Augmentation for Saliency Prediction via Latent Diffusion paper in ECCV 2024, Milano, Italy☆13Updated 8 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 5 months ago
- (CVPR2024 Highlight) Novel Class Discovery for Ultra-Fine-Grained Visual Categorization (UFG-NCD)☆20Updated last year
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆16Updated last year
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆12Updated 2 months ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated last year
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆25Updated 5 months ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆15Updated 4 months ago
- code for FineLIP☆26Updated 3 months ago