YeonwooSung / LIMoE-pytorch
PyTorch implementation of LIMoE
☆53Updated last year
Alternatives and similar repositories for LIMoE-pytorch:
Users that are interested in LIMoE-pytorch are comparing it to the libraries listed below
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆153Updated 2 years ago
- A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.☆44Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆24Updated last year
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆56Updated 10 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆78Updated 2 months ago
- ☆58Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆53Updated last year
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆111Updated last year
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"☆33Updated 7 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 5 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆82Updated 11 months ago
- Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation☆103Updated last month
- Official repository for the A-OKVQA dataset☆84Updated 11 months ago
- [ICLR 2023] Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning"☆59Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆160Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆205Updated 2 years ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆72Updated 5 months ago
- NegCLIP.☆31Updated 2 years ago
- Language Quantized AutoEncoders☆103Updated 2 years ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆48Updated last year
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆56Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆48Updated last year
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆91Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆58Updated 10 months ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆36Updated last year
- The Continual Learning in Multimodality Benchmark☆67Updated last year