Visual-AI/FROSTER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Visual-AI/FROSTER)

Visual-AI / FROSTER

[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition

☆101

Alternatives and similar repositories for FROSTER

Users that are interested in FROSTER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wengzejia1 / Open-VCLIP
View on GitHub
☆119Feb 19, 2024Updated 2 years ago
Visual-AI / SCD
View on GitHub
[CVPRW2024] What’s in a Name? Beyond Class Indices for Image Recognition
☆17Aug 30, 2024Updated last year
Visual-AI / PruneVid
View on GitHub
[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
☆71May 15, 2025Updated last year
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 6 months ago
Shahzadnit / EZ-CLIP
View on GitHub
☆24May 11, 2025Updated last year
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
Visual-AI / SPTNet
View on GitHub
[ICLR2024] SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
☆36Apr 9, 2025Updated last year
Visual-AI / HiLo
View on GitHub
[ICLR2025] HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts
☆22Aug 1, 2025Updated 11 months ago
ArielZc / CU-Net
View on GitHub
[CVPR 2023] Official code for paper: Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detecti…
☆32Jun 23, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kangzhiq / NNCSL
View on GitHub
[ICCV 2023 Oral] Official PyTorch implementation of our paper for semi-supervised continual learning "A soft nearest-neighbor framework f…
☆25Dec 17, 2024Updated last year
OliverHxh / SkeletonGCL
View on GitHub
[ICLR 2023] Graph Contrastive Learning for Skeleton-based Action Recognition.
☆57Jun 4, 2025Updated last year
Visual-AI / PromptCCD
View on GitHub
[ECCV2024] PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
☆31Apr 3, 2025Updated last year
naver-ai / tc-clip
View on GitHub
[ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"
☆102Feb 25, 2025Updated last year
taoyang1122 / adapt-image-models
View on GitHub
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
☆298Sep 17, 2023Updated 2 years ago
MCG-NJU / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆23Jul 29, 2024Updated last year
whwu95 / Text4Vis
View on GitHub
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
☆199May 30, 2024Updated 2 years ago
vladan-stojnic / ZLaP
View on GitHub
Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)
☆45Jul 23, 2024Updated last year
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
wlin-at / ViTTA
View on GitHub
Video Test-Time Adaptation for Action Recognition (CVPR 2023)
☆53Oct 13, 2024Updated last year
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago
DeLightCMU / ElaborativeRehearsal
View on GitHub
This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)
☆37Apr 9, 2022Updated 4 years ago
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
chenghao-ch94 / AGNN
View on GitHub
[MIPR 2022 & TMM 2023] "Attentive Graph Neural Networks for Few-shot Learning" with its extension version
☆16Apr 17, 2023Updated 3 years ago
xuyu0010 / ATCoN
View on GitHub
Repository for ECCV 2022 paper "Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition"
☆24Mar 9, 2023Updated 3 years ago
ctX-u / PLOVAD
View on GitHub
Source codes of our paper in TCSVT 2025: PLOVAD: Prompting Vision-Language Models for Open Vocabulary Video Anomaly Detection
☆33Feb 15, 2025Updated last year
Visual-AI / RegionDrag
View on GitHub
[ECCV2024] RegionDrag: Fast Region-Based Image Editing with Diffusion Models
☆67Oct 9, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
sming256 / AdaTAD
View on GitHub
[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆42Jul 9, 2024Updated 2 years ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated last year
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆356Nov 6, 2025Updated 8 months ago
yeliudev / R2-Tuning
View on GitHub
🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
☆92Jul 2, 2024Updated 2 years ago