CMU-MultiComp-Lab / mmml-course
☆84Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for mmml-course
- ☆32Updated 6 months ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆70Updated last year
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆91Updated 2 months ago
- [TMLR 2022] High-Modality Multimodal Transformer☆107Updated 2 weeks ago
- ☆33Updated 7 months ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆68Updated last year
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆219Updated 2 years ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆163Updated last year
- In-the-wild Question Answering☆15Updated last year
- ☆81Updated 3 months ago
- ☆34Updated last year
- ☆84Updated 2 years ago
- Holistic evaluation of multimodal foundation models☆41Updated 3 months ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆66Updated last year
- Video descriptions of research papers relating to foundation models and scaling☆30Updated last year
- A Survey on multimodal learning research.☆315Updated last year
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆176Updated 6 months ago
- ☆31Updated 2 months ago
- code for the ddp tutorial☆32Updated 2 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆223Updated 2 years ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆21Updated 11 months ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated last year
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆55Updated last month
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Updated 2 years ago
- ☆63Updated 2 years ago
- This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and …☆253Updated 2 years ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆72Updated 7 months ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆95Updated last year
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆130Updated 2 years ago