lzcemma / LeMDA
Code Example for Learning Multimodal Data Augmentation in Feature Space
☆42Updated 2 years ago
Alternatives and similar repositories for LeMDA:
Users that are interested in LeMDA are comparing it to the libraries listed below
- Official Implementation of "Geometric Multimodal Contrastive Representation Learning" (https://arxiv.org/abs/2202.03390)☆28Updated 2 months ago
- CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021☆62Updated 3 years ago
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆56Updated 9 months ago
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆48Updated 2 years ago
- ☆26Updated 3 years ago
- MixGen: A New Multi-Modal Data Augmentation☆122Updated 2 years ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated 10 months ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆33Updated 2 years ago
- Multi-label Image Recognition with Partial Labels (IJCV'24, ESWA'24, AAAI'22)☆38Updated 8 months ago
- ☆23Updated 2 years ago
- CVPR 2022, Robust Contrastive Learning against Noisy Views☆83Updated 3 years ago
- ☆57Updated last year
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆18Updated 3 years ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)☆36Updated 8 months ago
- offical implementation of "Calibrating Multimodal Learning" on ICML 2023☆20Updated last year
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆24Updated last year
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆27Updated last year
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆25Updated last year
- ☆68Updated last year
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆114Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆30Updated last year
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆35Updated last year
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆102Updated 2 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆52Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification☆27Updated last year
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆48Updated last year
- Vision-Language Pretraining & Efficient Transformer Papers.☆14Updated 3 years ago