[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
☆70Dec 23, 2024Updated last year
Alternatives and similar repositories for m2clip
Users that are interested in m2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆610Dec 6, 2023Updated 2 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- ☆42Apr 7, 2024Updated 2 years ago
- ☆119Feb 19, 2024Updated 2 years ago
- Taylor videos and Taylor-transformed skeletons (ICML 2024).☆16Jul 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- A simple but efficient transformer model for video action recognition☆64Oct 8, 2022Updated 3 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆126Jul 1, 2023Updated 2 years ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆47Mar 21, 2025Updated last year
- Convolutional Initialization for Data-Efficient Vision Transformers☆15Dec 9, 2025Updated 5 months ago
- Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)☆26May 11, 2025Updated last year
- (AAAI 2024) DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition☆26Apr 15, 2024Updated 2 years ago
- ☆49Nov 12, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Modality-Invariant Temporal Representation Learning☆22Apr 21, 2023Updated 3 years ago
- ☆25Apr 16, 2025Updated last year
- Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition☆25Jul 12, 2022Updated 3 years ago
- 生成中文文字识别(OCR)的训练数据☆12Mar 2, 2020Updated 6 years ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- ☆12Dec 14, 2023Updated 2 years ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆80Mar 7, 2024Updated 2 years ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Oct 20, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A series of face anti-spoofing datasets, for the convenience of management and benchmarking.☆17May 12, 2026Updated last week
- [ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition☆300Sep 17, 2023Updated 2 years ago
- Codes for "UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequenc…☆30Jan 9, 2024Updated 2 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- 生成为所欲为动图,灵感来自于sorry项目☆11Mar 28, 2020Updated 6 years ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆101Jan 14, 2025Updated last year
- ☆10Dec 17, 2024Updated last year
- ☆15May 12, 2025Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆100Jan 23, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)☆20Apr 5, 2023Updated 3 years ago
- [AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models☆28Dec 14, 2023Updated 2 years ago
- ☆29Oct 1, 2025Updated 7 months ago
- [ECCV 2022] Code for the paper, ReAct: Temporal Action Detection with Relational Queries☆39Oct 19, 2022Updated 3 years ago
- A Pytorch implementation of ICML 2023 paper "NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation"☆36Dec 2, 2023Updated 2 years ago
- ☆86May 8, 2023Updated 3 years ago