[AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
☆71Dec 23, 2024Updated last year
Alternatives and similar repositories for m2clip
Users that are interested in m2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆608Dec 6, 2023Updated 2 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago
- ☆42Apr 7, 2024Updated 2 years ago
- ☆119Feb 19, 2024Updated 2 years ago
- Taylor videos and Taylor-transformed skeletons (ICML 2024).☆16Jul 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆94Feb 25, 2025Updated last year
- A simple but efficient transformer model for video action recognition☆64Oct 8, 2022Updated 3 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆126Jul 1, 2023Updated 2 years ago
- ☆17Oct 10, 2023Updated 2 years ago
- [ACMMM 2024] Implementation of the paper “Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition“.☆45Mar 21, 2025Updated last year
- Convolutional Initialization for Data-Efficient Vision Transformers☆15Dec 9, 2025Updated 4 months ago
- Pytorch implementation for "Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning" (ICML 2024)☆26May 11, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- A Cross-Modal RGB-Event Benchmark for Multi-Object Tracking and Detection.☆12Oct 17, 2023Updated 2 years ago
- (AAAI 2024) DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition☆26Apr 15, 2024Updated 2 years ago
- ☆49Nov 12, 2022Updated 3 years ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆155Sep 9, 2024Updated last year
- Modality-Invariant Temporal Representation Learning☆22Apr 21, 2023Updated 2 years ago
- ☆25Apr 16, 2025Updated last year
- Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition☆25Jul 12, 2022Updated 3 years ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Multimodal sentiment analysis using transformer encoders and fusion across text, audio, and visual features on the CMU-MOSEI dataset usin…☆13Jun 4, 2025Updated 10 months ago
- ☆12Dec 14, 2023Updated 2 years ago
- A series of face anti-spoofing datasets, for the convenience of management and benchmarking.☆16Dec 10, 2024Updated last year
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆78Mar 7, 2024Updated 2 years ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Oct 20, 2023Updated 2 years ago
- [ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition☆301Sep 17, 2023Updated 2 years ago
- Codes for "UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequenc…☆30Jan 9, 2024Updated 2 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆184Aug 20, 2022Updated 3 years ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆94Jan 23, 2026Updated 2 months ago
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆184May 22, 2024Updated last year
- ☆10Dec 17, 2024Updated last year
- [ECAI-2024] OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning☆16Jan 7, 2025Updated last year
- ☆13Jun 4, 2020Updated 5 years ago
- [ICCV 2023] Rethinking Point Cloud Registration as Masking and Reconstruction☆10Aug 14, 2023Updated 2 years ago