muzairkhattak / ViFi-CLIPLinks

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

☆285

Alternatives and similar repositories for ViFi-CLIP

Users that are interested in ViFi-CLIP are comparing it to the libraries listed below

Sorting:

OpenGVLab / unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆336Updated last year
xuguohai / X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
☆168Updated last year
TalalWasim / Vita-CLIP
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆121Updated 2 years ago
OpenGVLab / efficient-video-recognition
☆176Updated 2 years ago
wjun0830 / CGDETR
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆134Updated 11 months ago
miccunifi / SEARLE
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
☆183Updated this week
ruiwang2021 / mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆133Updated 2 years ago
sming256 / AdaTAD
[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆36Updated last year
dhg-wei / DeCap
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆136Updated 2 years ago
ju-chen / Efficient-Prompt
☆193Updated 2 years ago
sudo-Boris / mr-Blip
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆89Updated 4 months ago
TalalWasim / Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
☆101Updated last year
mbzuai-oryx / Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
☆257Updated last year
wengzejia1 / Open-VCLIP
☆117Updated last year
Ziyang412 / UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆65Updated last year
LijieFan / LaCLIP
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆283Updated last year
wjun0830 / QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …
☆234Updated last year
huangb23 / VTimeLLM
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
☆282Updated last year
daniel-code / TubeViT
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆92Updated 10 months ago
fmu2 / snag_release
Official Implementation of SnAG (CVPR 2024)
☆51Updated 3 months ago
Malitha123 / awesome-video-self-supervised-learning
A curated list of awesome self-supervised learning methods in videos
☆149Updated 3 weeks ago
Yui010206 / SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
☆187Updated last year
NeeluMadan / ViFM_Survey
Foundation Models for Video Understanding: A Survey
☆128Updated 3 weeks ago
jayleicn / moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
☆321Updated last year
Visual-AI / FROSTER
The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"
☆84Updated 6 months ago
naver-ai / tc-clip
[ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"
☆69Updated 5 months ago
j-min / HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆102Updated 6 months ago
antoyang / TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆183Updated last year
gyxxyg / VTG-LLM
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
☆107Updated 7 months ago
antoyang / VidChapters
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
☆192Updated last year