gorjanradevski / multimodal-distillation
Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)
☆25Updated last year
Alternatives and similar repositories for multimodal-distillation
Users that are interested in multimodal-distillation are comparing it to the libraries listed below
Sorting:
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆17Updated last year
- ☆79Updated 2 years ago
- [AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆21Updated 2 years ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆38Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆29Updated last year
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆102Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- ☆27Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆74Updated 2 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆78Updated 9 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆58Updated 2 months ago
- Code for dmrnet☆23Updated 2 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆42Updated 3 weeks ago
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆31Updated 2 years ago
- Code accompanying PDiscoNet: Semantically consistent part discovery for fine-grained recognition☆13Updated last year
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆36Updated 10 months ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆72Updated last year
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆13Updated 10 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆71Updated 3 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆35Updated 4 months ago
- ☆61Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆79Updated 4 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆42Updated 10 months ago
- ☆27Updated 2 years ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆66Updated 2 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆53Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆29Updated 7 months ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆12Updated 5 months ago