CMU-MultiComp-Lab / mmml-tutorialLinks
☆30Updated 2 years ago
Alternatives and similar repositories for mmml-tutorial
Users that are interested in mmml-tutorial are comparing it to the libraries listed below
Sorting:
- ☆40Updated last year
- ☆95Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆26Updated 3 months ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- ☆16Updated 2 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…☆16Updated 2 years ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Updated last year
- ☆100Updated 3 years ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆171Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- ☆49Updated 3 years ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆121Updated last year
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆98Updated last year
- Basic guidance on how to contribute to Papers with Code☆24Updated 3 years ago
- code for the ddp tutorial☆32Updated 3 years ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆43Updated 7 months ago
- ☆43Updated last year
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆81Updated 5 months ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated 2 years ago
- ☆32Updated 4 months ago
- [TMLR 2022] High-Modality Multimodal Transformer☆117Updated last year
- A curated list of vision-and-language pre-training (VLP). :-)☆59Updated 3 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated last week
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated last year
- ☆17Updated 5 months ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆78Updated 2 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Neural Networks and Deep Learning, NUS CS5242, 2021☆191Updated 4 years ago