taoyang1122/adapt-image-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/taoyang1122/adapt-image-models)

taoyang1122 / adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

☆299

Alternatives and similar repositories for adapt-image-models

Users that are interested in adapt-image-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linziyi96 / st-adapter
View on GitHub
☆87May 8, 2023Updated 3 years ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated last year
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
ju-chen / Efficient-Prompt
View on GitHub
☆197Oct 22, 2022Updated 3 years ago
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
wengzejia1 / Open-VCLIP
View on GitHub
☆119Feb 19, 2024Updated 2 years ago
KMnP / vpt
View on GitHub
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
☆1,240Sep 2, 2023Updated 2 years ago
alibaba-mmai-research / CLIP-FSAR
View on GitHub
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
☆82Mar 7, 2024Updated 2 years ago
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
Visual-AI / FROSTER
View on GitHub
[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
☆101Jan 14, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆614Dec 6, 2023Updated 2 years ago
OpenGVLab / VideoMAEv2
View on GitHub
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆804Oct 8, 2024Updated last year
whwu95 / Text4Vis
View on GitHub
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
☆199May 30, 2024Updated 2 years ago
whwu95 / ATM
View on GitHub
【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?
☆74Jan 26, 2024Updated 2 years ago
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
klauscc / VindLU
View on GitHub
☆108Dec 23, 2022Updated 3 years ago
MCG-NJU / MGSampler
View on GitHub
[ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition
☆53Jul 9, 2022Updated 4 years ago
JieShibo / PETL-ViT
View on GitHub
[ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass
☆202Aug 1, 2023Updated 2 years ago
KunyuLin / XOV-Action
View on GitHub
The first work for cross-domain open-vocabulary action recognition with a benchmark
☆21Jul 9, 2026Updated 2 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
park-jungin / DualPath
View on GitHub
☆49Nov 12, 2022Updated 3 years ago
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
FuchenUSTC / DTF
View on GitHub
☆16Aug 5, 2022Updated 3 years ago
ShoufaChen / AdaptFormer
View on GitHub
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
☆388Sep 16, 2022Updated 3 years ago
Francis-Rings / ILA
View on GitHub
[ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition
☆41Nov 29, 2023Updated 2 years ago
zhengzangw / DoPrompt
View on GitHub
Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"
☆48Jan 29, 2023Updated 3 years ago
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
KaiyangZhou / CoOp
View on GitHub
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
☆2,218May 20, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
DeLightCMU / ElaborativeRehearsal
View on GitHub
This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)
☆37Apr 9, 2022Updated 4 years ago
ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
alibaba-mmai-research / TAdaConv
View on GitHub
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆246Aug 23, 2023Updated 2 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 3 years ago
RERV / UniAdapter
View on GitHub
[ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …
☆77Jan 27, 2024Updated 2 years ago
sauradip / STALE
View on GitHub
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
☆116Aug 3, 2023Updated 2 years ago
microsoft / XPretrain
View on GitHub
Multi-modality pre-training
☆511Mar 27, 2026Updated 3 months ago