Official Implementation of DiffCLIP: Differential Attention Meets CLIP
☆54Mar 12, 2025Updated last year
Alternatives and similar repositories for DiffCLIP
Users that are interested in DiffCLIP are comparing it to the libraries listed below
Sorting:
- The code will come soon.☆15Sep 12, 2025Updated 6 months ago
- [AAAI 2025] Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification☆19Apr 17, 2025Updated 11 months ago
- About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)☆11Aug 2, 2024Updated last year
- The official repos of "Rethinking Multi-view Representation Learning via Distilled Disentangling"☆12Apr 3, 2024Updated last year
- Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering☆10May 26, 2024Updated last year
- Code of Graph Contrastive Partial Multi-View Clustering☆12Mar 10, 2025Updated last year
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- Official code and datas for "Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid Views". (NeurIPS 2024)☆16Oct 13, 2024Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆42Nov 1, 2024Updated last year
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- The official code for [Heterogeneity-Aware Federated Deep Multi-View Clustering towards Diverse Feature Representations] ( ACM MM 24 )☆18Nov 16, 2024Updated last year
- The official codebase for Reflected Flow Matching (ICML 2024)☆22Jun 19, 2024Updated last year
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 9 months ago
- The code for "Deep Contrastive Graph Learning with Clustering-Oriented Guidance" (AAAI24).☆19Apr 13, 2024Updated last year
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆57Oct 14, 2025Updated 5 months ago
- A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)☆12Aug 11, 2025Updated 7 months ago
- Training scripts and Python modules for the ECCV 2018 paper "Interpolating Convolutional Networks Using Batch Normalization"☆11Jun 13, 2020Updated 5 years ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Feb 8, 2024Updated 2 years ago
- ☆11Oct 20, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- A Novel Federated Multi-View Clustering Method for Unaligned and Incomplete Data Fusion☆25Oct 13, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆21Jan 11, 2026Updated 2 months ago
- ☆35Feb 5, 2024Updated 2 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- Online Hyperparameter Optimization☆11Feb 17, 2021Updated 5 years ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- VF-NeRF code. We learn to densely reconstruct indoor from multi-view images by representing the surface with Vector Fields (VF). We devel…☆20Sep 30, 2024Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".☆12Oct 11, 2024Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆17Jun 3, 2025Updated 9 months ago
- [AAAI 2024] ICMVC: Incomplete Contrastive Multi-View Clustering with High-confidence Guiding☆33Jul 22, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- [WWW 2025 Oral] ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning☆20Jul 2, 2025Updated 8 months ago