pliang279 / MultiVizLinks

[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models

☆98

Alternatives and similar repositories for MultiViz

Users that are interested in MultiViz are comparing it to the libraries listed below

Sorting:

pliang279 / FactorCL
[NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
☆73Updated 2 years ago
pliang279 / PID
[NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions
☆84Updated last year
miguelsvasco / gmc
Official Implementation of "Geometric Multimodal Contrastive Representation Learning" (https://arxiv.org/abs/2202.03390)
☆28Updated 10 months ago
pliang279 / HighMMT
[TMLR 2022] High-Modality Multimodal Transformer
☆117Updated last year
Weixin-Liang / Modality-Gap
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
☆165Updated 3 years ago
nyukat / greedy_multimodal_learning
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
☆30Updated 3 years ago
marslanm / Multimodality-Representation-Learning
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…
☆81Updated 5 months ago
changdaeoh / multimodal-mixup
Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"
☆35Updated last year
facebookresearch / reliable_vqa
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆37Updated 2 years ago
IntelLabs / VL-InterpreT
Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
☆97Updated 2 years ago
salesforce / hierarchicalContrastiveLearning
☆162Updated 5 months ago
chingyaoc / RINCE
CVPR 2022, Robust Contrastive Learning against Noisy Views
☆84Updated 3 years ago
young-geng / m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆103Updated 8 months ago
Trustworthy-ML-Lab / Label-free-CBM
[ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…
☆120Updated last year
ys-zong / awesome-self-supervised-multimodal-learning
[T-PAMI] A curated list of self-supervised multimodal learning resources.
☆268Updated last year
Heidelberg-NLP / MM-SHAP
This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…
☆31Updated last year
mertyg / post-hoc-cbm
Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023
☆86Updated last year
fawazsammani / nlxgpt
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
☆48Updated last year
pliang279 / HEMM
Holistic evaluation of multimodal foundation models
☆47Updated last year
zhjohnchan / awesome-vision-and-language-pretraining
A curated list of vision-and-language pre-training (VLP). :-)
☆59Updated 3 years ago
StanfordMIMI / villa
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆46Updated 2 years ago
IBM / model-reprogramming
Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>
☆63Updated last month
YeonwooSung / LIMoE-pytorch
PyTorch implementation of LIMoE
☆52Updated last year
alextamkin / dabs
A Domain-Agnostic Benchmark for Self-Supervised Learning
☆106Updated 2 years ago
PKU-ML / CLIP-Help-SimCLR
Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning
☆26Updated 2 years ago
goel-shashank / CyCLIP
☆120Updated 2 years ago
EPFLiGHT / MultiModN
MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)
☆34Updated 2 years ago
AI4LIFE-GROUP / SpLiCE
Sparse Linear Concept Embeddings
☆116Updated 7 months ago
divyam3897 / I2M2
I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)
☆22Updated last year
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆34Updated 2 years ago