gorjanradevski / multimodal-distillation
Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)
☆23Updated last year
Alternatives and similar repositories for multimodal-distillation:
Users that are interested in multimodal-distillation are comparing it to the libraries listed below
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆15Updated last year
- ☆73Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆27Updated last year
- [AAAI 2023] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆17Updated 2 years ago
- ☆47Updated 2 years ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆49Updated 7 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆69Updated last month
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆35Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated 9 months ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆74Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆67Updated 6 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆52Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆39Updated 4 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆38Updated 7 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆70Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆26Updated 9 months ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated 11 months ago
- Code for Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos☆9Updated 5 months ago
- [ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition☆30Updated 2 months ago
- ☆32Updated last year
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Updated 3 months ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Updated 2 months ago
- A curated list of awesome self-supervised learning methods in videos☆127Updated last month
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆67Updated last week
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆80Updated 10 months ago
- ☆20Updated 2 years ago
- ☆15Updated 11 months ago
- [CVPR 2024] - Official code for the paper "Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation"☆33Updated 5 months ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆36Updated 3 months ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆50Updated 5 months ago