SamsungLabs / AdaCLIP
This repository contains the code for AdaCLIP, a computation and latency-aware system for pragmatic multimodal video retrieval.
☆10Updated 9 months ago
Alternatives and similar repositories for AdaCLIP:
Users that are interested in AdaCLIP are comparing it to the libraries listed below
- ☆16Updated 4 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆33Updated 10 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆15Updated 5 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆16Updated last month
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆9Updated this week
- [CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering☆16Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆27Updated last week
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆40Updated last month
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆34Updated 9 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆29Updated last month
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆31Updated 7 months ago
- Official implementation for CIGN☆15Updated last year
- Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension☆22Updated 4 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆21Updated 3 months ago
- [ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness☆17Updated last year
- [CVPR 2024 Highlight] ImageNet-D☆41Updated 5 months ago
- ☆27Updated last year
- ☆37Updated 7 months ago
- ☆21Updated 8 months ago
- ☆16Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆38Updated 2 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆18Updated 3 months ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆24Updated last month
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆49Updated 2 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆44Updated 3 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated 2 weeks ago
- ☆14Updated 5 months ago