gorjanradevski / multimodal-distillation
Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)
☆21Updated 7 months ago
Related projects: ⓘ
- [AAAI 2023] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆16Updated last year
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆25Updated 2 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆43Updated 2 months ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆26Updated last year
- Official PyTorch implementation of "Masked Images Are Counterfactual Samples for Robust Fine-tuning" (CVPR 2023)☆12Updated last year
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆71Updated 7 months ago
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆21Updated 11 months ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆10Updated 7 months ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Updated 2 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆66Updated last year
- ☆45Updated last year
- MaskCon: Masked Contrastive Learning for Coarse-Labeled Dataset (CVPR2023)☆31Updated 7 months ago
- ☆48Updated 9 months ago
- Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024☆13Updated 5 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆41Updated last year
- ☆67Updated last year
- Official repository of ”Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning"☆18Updated 3 weeks ago
- [NeurIPS 2023] Meta-Adapter☆35Updated 10 months ago
- ☆37Updated 8 months ago
- [CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models.☆51Updated 2 months ago
- [CVPR'23] Official PyTorch implementation of Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification…☆36Updated 10 months ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆33Updated 8 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆51Updated last month
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆34Updated last year
- ☆21Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆137Updated 9 months ago
- ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models☆27Updated 9 months ago
- ☆27Updated this week
- ☆25Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆36Updated 9 months ago