young-geng / m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆100Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for m3ae_public
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆130Updated 2 years ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆146Updated 11 months ago
- ☆63Updated 2 years ago
- ☆61Updated last year
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆145Updated last year
- [TMLR 2022] High-Modality Multimodal Transformer☆107Updated 2 weeks ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆79Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"