peterant330 / KUEALinks
[ICML'25] Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models
☆14Updated 3 weeks ago
Alternatives and similar repositories for KUEA
Users that are interested in KUEA are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆50Updated last week
- CLIP-MoE: Mixture of Experts for CLIP☆45Updated 10 months ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆53Updated last year
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆23Updated 2 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆57Updated 9 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆42Updated 9 months ago
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆70Updated 3 months ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆42Updated 5 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆24Updated 10 months ago
- Official implementation of TagAlign☆35Updated 8 months ago
- cliptrase☆43Updated last year
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆40Updated this week
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆30Updated last month
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆82Updated last year
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆25Updated 2 weeks ago
- ☆17Updated 9 months ago
- Official Implementation of CODE☆15Updated 11 months ago
- ☆56Updated last year
- [CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search☆46Updated 3 weeks ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆74Updated 2 years ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆102Updated 3 months ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆34Updated last month
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆44Updated 5 months ago
- ☆23Updated last year
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆26Updated 11 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆33Updated 5 months ago
- ☆19Updated 3 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆96Updated this week
- [ICML2024]The official implementation of SemiRES in PyTorch.☆28Updated last year
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆54Updated 2 years ago