SamsungLabs / AdaCLIP

This repository contains the code for AdaCLIP, a computation and latency-aware system for pragmatic multimodal video retrieval.

☆10

Alternatives and similar repositories for AdaCLIP

Users that are interested in AdaCLIP are comparing it to the libraries listed below

Sorting:

KHU-VLL / DEVIAS
[ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"
☆19Updated 7 months ago
KAIST-Visual-AI-Group / APC-VLM
Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆26Updated 3 weeks ago
ytaek-oh / fsc-clip
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆16Updated 7 months ago
yliu-cs / PiTe
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆16Updated 3 months ago
sterzhang / PVIT
Official Repository of Personalized Visual Instruct Tuning
☆28Updated 2 months ago
slowfast-vgen / slowfast-vgen
☆22Updated 6 months ago
lxa9867 / r2bench
[ECCV 2024] R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
☆10Updated 9 months ago
UCSC-VLAA / MixCon3D
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
☆35Updated last year
ruili33 / TPO
☆32Updated 3 months ago
Adamdad / Repfusion
☆54Updated last year
hammoudhasan / DiffCLIP
Official Implementation of DiffCLIP: Differential Attention Meets CLIP
☆26Updated 2 months ago
tian1327 / SWAT
[CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
☆17Updated last month
zhiheLu / Ensemble_VLM
Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"
☆24Updated 3 months ago
LeapLabTHU / EchoWorld
[CVPR 2025] EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
☆15Updated last month
Vision-CAIR / InfiniBench
☆14Updated 7 months ago
lxa9867 / PaintSeg
[NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"
☆14Updated last year
MCR-PEFT / Ex-MCR
☆43Updated 3 weeks ago
MKYucel / hybrid_augment
[ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness
☆17Updated last year
ltttpku / CMMP
☆19Updated 6 months ago
SEU-VIPGroup / Understanding_Vision_Tasks
☆12Updated 3 months ago
cfeng16 / GPS2Pix
[CVPR 2025] GPS as a Control Signal for Image Generation
☆18Updated 2 months ago
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year
qhfan / RALA
[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆20Updated 2 months ago
deepglint / Croc
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
☆23Updated 6 months ago
tripletclip / TripletCLIP
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"
☆39Updated 5 months ago
HaroldChen19 / VistaDPO
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆24Updated 2 weeks ago
jaehong31 / SAFREE
[ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation
☆38Updated 3 months ago
alhojel / visual_task_vectors
☆37Updated 10 months ago
adobe-research / llava-score
☆11Updated 7 months ago
wjpoom / SPEC
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
☆43Updated 2 months ago