sallymmx/ActionCLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sallymmx/ActionCLIP)

sallymmx / ActionCLIP

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

☆614

Alternatives and similar repositories for ActionCLIP

Users that are interested in ActionCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ju-chen / Efficient-Prompt
View on GitHub
☆197Oct 22, 2022Updated 3 years ago
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,029Apr 12, 2024Updated 2 years ago
microsoft / VideoX
View on GitHub
VideoX: a collection of video cross-modal models
☆1,072Jun 3, 2024Updated 2 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
ttlmh / Bridge-Prompt
View on GitHub
[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
☆102Oct 30, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
open-mmlab / mmaction2
View on GitHub
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆5,099Mar 18, 2026Updated 4 months ago
sauradip / STALE
View on GitHub
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
☆116Aug 3, 2023Updated 2 years ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
alibaba-mmai-research / CLIP-FSAR
View on GitHub
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
☆82Mar 7, 2024Updated 2 years ago
sallymmx / m2clip
View on GitHub
[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
☆70Dec 23, 2024Updated last year
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
CryhanFang / CLIP2Video
View on GitHub
☆260Dec 10, 2022Updated 3 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
happyharrycn / actionformer_release
View on GitHub
Code release for ActionFormer (ECCV 2022)
☆570Apr 11, 2024Updated 2 years ago
MartinXM / GAP
View on GitHub
official implementation for Language Supervised Training for Skeleton-based Action Recognition
☆131Sep 6, 2023Updated 2 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
webber2933 / iCLIP
View on GitHub
[ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
☆21Feb 22, 2024Updated 2 years ago
KaiyangZhou / CoOp
View on GitHub
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
☆2,217May 20, 2024Updated 2 years ago
raoyongming / DenseCLIP
View on GitHub
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
☆550Sep 15, 2023Updated 2 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,392Mar 16, 2026Updated 4 months ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
Alvin-Zeng / Awesome-Temporal-Action-Localization
View on GitHub
A curated list of temporal action localization/detection and related area (e.g. temporal action proposal) resources.
☆588Sep 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
OpenGVLab / InternVideo
View on GitHub
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
☆2,339Jul 2, 2026Updated 2 weeks ago
mit-han-lab / temporal-shift-module
View on GitHub
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆2,215Jul 11, 2024Updated 2 years ago
facebookresearch / Motionformer
View on GitHub
Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers
☆234Jun 13, 2022Updated 4 years ago
whwu95 / Text4Vis
View on GitHub
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
☆199May 30, 2024Updated 2 years ago
MCG-NJU / RTD-Action
View on GitHub
[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation
☆92Apr 5, 2022Updated 4 years ago
xuguohai / X-CLIP
View on GitHub
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
☆185Apr 6, 2024Updated 2 years ago
MCG-NJU / TDN
View on GitHub
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
☆384Sep 17, 2022Updated 3 years ago
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
showlab / all-in-one
View on GitHub
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆281Mar 25, 2023Updated 3 years ago
tobyperrett / trx
View on GitHub
Temporal-Relational CrossTransformers (CVPR 2021)
☆112Mar 31, 2021Updated 5 years ago
showlab / EgoVLP
View on GitHub
[NeurIPS 2022] Egocentric Video-Language Pretraining
☆261May 9, 2024Updated 2 years ago
kennymckormick / pyskl
View on GitHub
A toolbox for skeleton-based action recognition.
☆1,252Feb 19, 2026Updated 5 months ago
dingfengshi / TriDet
View on GitHub
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
☆219Dec 27, 2023Updated 2 years ago
cvdfoundation / kinetics-dataset
View on GitHub
☆981May 15, 2024Updated 2 years ago
zhenyingfang / Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
View on GitHub
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
☆591Jul 15, 2026Updated last week