CMU-MultiComp-Lab / mmml-courseLinks

☆93

Alternatives and similar repositories for mmml-course

Users that are interested in mmml-course are comparing it to the libraries listed below

Sorting:

CMU-MultiComp-Lab / adv-mmml-course
☆38Updated last year
CMU-MultiComp-Lab / mmml-tutorial
☆30Updated 2 years ago
pliang279 / HighMMT
[TMLR 2022] High-Modality Multimodal Transformer
☆117Updated last year
slds-lmu / seminar_multimodal_dl
https://slds-lmu.github.io/seminar_multimodal_dl/
☆171Updated 2 years ago
dsaidgovsg / multimodal-learning-hands-on-tutorial
☆101Updated 3 years ago
marslanm / Multimodality-Representation-Learning
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…
☆81Updated 4 months ago
sayakpaul / robustness-foundation-models
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
☆72Updated 2 years ago
pliang279 / MultiViz
[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models
☆97Updated last year
zhjohnchan / awesome-vision-and-language-pretraining
A curated list of vision-and-language pre-training (VLP). :-)
☆59Updated 3 years ago
vincentlux / Awesome-Multimodal-LLM
Reading list for Multimodal Large Language Models
☆68Updated 2 years ago
dondongwon / LPMDataset
☆46Updated 2 years ago
EdisonLeeeee / ICLR2023-OpenReviewData
ICLR 2023 Paper submission analysis from https://openreview.net/group?id=ICLR.cc/2023/Conference
☆106Updated 3 years ago
albanie / foundation-models
Video descriptions of research papers relating to foundation models and scaling
☆30Updated 2 years ago
calpt / awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning
☆199Updated last year
google-research / head2toe
☆81Updated last year
The-AI-Summer / pytorch-ddp
code for the ddp tutorial
☆32Updated 3 years ago
multimodal / multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
☆83Updated 3 years ago
Toloka / WSDMCup2023
Toloka Visual Question Answering Challenge at WSDM Cup 2023
☆31Updated last year
ekinakyurek / google-research
Google Research
☆46Updated 3 years ago
martenlienen / icml-neurips-iclr-dataset
Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2024
☆43Updated 6 months ago
microsoft / BridgeTower
Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"
☆166Updated 2 years ago
junchen14 / Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…
☆230Updated 3 years ago
weigq / neurips2021_stats
☆35Updated 3 years ago
paperswithcode / tutorials
Basic guidance on how to contribute to Papers with Code
☆24Updated 3 years ago
MichiganNLP / In-the-wild-QA
In-the-wild Question Answering
☆15Updated 2 years ago
jacobmarks / awesome-neurips-2023
Conference schedule, top papers, and analysis of the data for NeurIPS 2023!
☆121Updated last year
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
IntelLabs / VL-InterpreT
Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
☆97Updated 2 years ago
google-deepmind / multimodal_transformers
☆64Updated 3 years ago
goel-shashank / CyCLIP
☆120Updated 2 years ago