wlin-at/MAXI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wlin-at/MAXI)

wlin-at / MAXI

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)

☆31

Alternatives and similar repositories for MAXI

Users that are interested in MAXI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
engindeniz / vitis
View on GitHub
[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
☆13Jan 13, 2025Updated last year
wengzejia1 / Open-VCLIP
View on GitHub
☆119Feb 19, 2024Updated 2 years ago
sauradip / STALE
View on GitHub
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
☆116Aug 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tomchen-ctj / OST
View on GitHub
【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
☆39Apr 27, 2024Updated 2 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
sauradip / MUPPET
View on GitHub
[ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"
☆16Aug 30, 2023Updated 2 years ago
pulkitkumar95 / tats
View on GitHub
☆20Feb 20, 2025Updated last year
AndongDeng / BEAR
View on GitHub
BEAR: a new BEnchmark on video Action Recognition
☆46Apr 21, 2024Updated 2 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
alibaba-mmai-research / CLIP-FSAR
View on GitHub
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
☆82Mar 7, 2024Updated 2 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 3 years ago
jiazheng-xing / SloshNet
View on GitHub
[AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)
☆14Jan 10, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
KPeng9510 / Trans4SOAR
View on GitHub
☆14Apr 1, 2023Updated 3 years ago
Shahzadnit / EZ-CLIP
View on GitHub
☆24May 11, 2025Updated last year
ekazakos / MTCN
View on GitHub
Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch
☆20Dec 16, 2021Updated 4 years ago
alibaba-mmai-research / HyRSMPlusPlus
View on GitHub
Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".
☆15Jan 3, 2023Updated 3 years ago
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
jbertrand89 / matching_based_fsar
View on GitHub
☆12Mar 30, 2023Updated 3 years ago
starrycos / PAINet
View on GitHub
[ICCV'23] PAINet: Parallel Attention Interaction Network for Few-shot Skeleton-based Action Recognition
☆11Oct 14, 2023Updated 2 years ago
Chiaraplizz / ARGO1M-What-can-a-cook
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
intel / TVP
View on GitHub
☆15Aug 4, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Sarinda251 / CDFSL-V
View on GitHub
Accepted at ICCV '23
☆16Oct 4, 2023Updated 2 years ago
shiyi-zh0408 / LOGO
View on GitHub
[CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
☆48Apr 9, 2024Updated 2 years ago
Dawn-LX / OpenVoc-VidVRD
View on GitHub
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Jun 4, 2024Updated 2 years ago
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated last year
AIM3-RUC / Youmakeup_Challenge2022
View on GitHub
☆17Jun 15, 2022Updated 4 years ago
RyannDaGreat / rp
View on GitHub
This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5
☆13Jul 13, 2026Updated last week
R00Kie-Liu / TA2N
View on GitHub
TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition
☆17Mar 26, 2024Updated 2 years ago
webber2933 / iCLIP
View on GitHub
[ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
☆21Feb 22, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Droliven / diverse_sampling
View on GitHub
Official project of DiverseSampling (ACMMM2022 Paper)
☆16Feb 25, 2023Updated 3 years ago
franciszzj / OpenPSG
View on GitHub
[ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
☆51Jan 8, 2025Updated last year
deep-real / DEAL
View on GitHub
The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)
☆20Mar 9, 2026Updated 4 months ago
linziyi96 / st-adapter
View on GitHub
☆87May 8, 2023Updated 3 years ago
ATAboukhadra / THOR-Net
View on GitHub
End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with self-supervision
☆21May 15, 2023Updated 3 years ago
Richard-61 / FineAction
View on GitHub
The official codebase of FineAction dataset. We will update the data and code of our FineAction.
☆24Apr 10, 2025Updated last year
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated last year