gorjanradevski / multimodal-distillationLinks
Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)
☆25Updated last year
Alternatives and similar repositories for multimodal-distillation
Users that are interested in multimodal-distillation are comparing it to the libraries listed below
Sorting:
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆17Updated last year
- ☆79Updated 2 years ago
- [AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆21Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆38Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆46Updated 7 months ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆49Updated last year
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated last year
- Code for dmrnet☆24Updated 2 weeks ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆79Updated 4 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆42Updated 11 months ago
- ☆36Updated 2 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆118Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆76Updated 2 years ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆66Updated 3 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆58Updated 3 months ago
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆31Updated 2 years ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- ☆27Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆12Updated 6 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 4 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆53Updated last year
- Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"☆41Updated 7 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆14Updated 11 months ago
- ☆61Updated last year
- ☆23Updated last year