SamsungLabs / AdaCLIP
This repository contains the code for AdaCLIP, a computation and latency-aware system for pragmatic multimodal video retrieval.
☆10Updated 10 months ago
Alternatives and similar repositories for AdaCLIP:
Users that are interested in AdaCLIP are comparing it to the libraries listed below
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Updated 11 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆16Updated last month
- The official code for MedAgent_Pro☆13Updated this week
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆16Updated 6 months ago
- ☆17Updated 5 months ago
- ☆19Updated last week
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆27Updated 10 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆41Updated 2 months ago
- WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs☆20Updated last month
- [ECCV 2024] R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations☆10Updated 8 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆19Updated 2 weeks ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated last month
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆32Updated 2 months ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆32Updated 2 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆12Updated last month
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆24Updated last month
- [CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering☆16Updated last year
- ☆54Updated last year
- ☆10Updated 10 months ago
- ☆23Updated 2 years ago
- ☆19Updated 5 months ago
- [ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness☆17Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆37Updated 4 months ago
- ☆28Updated last year
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆19Updated 4 months ago
- ☆11Updated 6 months ago
- ☆14Updated 6 months ago
- [CVPRW] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆24Updated last week
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆38Updated 3 months ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Updated 2 years ago