yaohungt / Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
☆860Updated 2 years ago
Alternatives and similar repositories for Multimodal-Transformer:
Users that are interested in Multimodal-Transformer are comparing it to the libraries listed below
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆806Updated last year
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆261Updated 4 years ago
- ☆199Updated 3 years ago
- MMSA is a unified framework for Multimodal Sentiment Analysis.☆745Updated last month
- [NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning☆519Updated last year
- MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis☆218Updated last year
- Attention-based multimodal fusion for sentiment analysis☆339Updated 10 months ago
- Pytorch Implementation of Tensor Fusion Networks for multimodal sentiment analysis.☆185Updated 4 years ago
- Codes for paper "Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis"☆206Updated 2 years ago
- [AAAI 2018] Memory Fusion Network for Multi-view Sequential Learning☆114Updated 4 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆788Updated 3 years ago
- This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information M…☆177Updated last year
- Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"☆1,436Updated 11 months ago
- This is a short tutorial for using the CMU-MultimodalSDK.☆82Updated 5 years ago
- Paper List for Multimodal Sentiment Analysis☆99Updated 4 years ago
- A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis☆121Updated last week
- A curated list of Multimodal Related Research.☆1,334Updated last year
- Code for ALBEF: a new vision-language pre-training method☆1,610Updated 2 years ago
- ☆220Updated last year
- ☆163Updated 4 years ago
- Multi Task Vision and Language☆804Updated 3 years ago
- A Tool for extracting multimodal features from videos.☆158Updated 2 years ago
- ☆179Updated last year
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆79Updated 3 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆349Updated 5 years ago
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,151Updated 2 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆943Updated 2 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆532Updated last year
- Deep Modular Co-Attention Networks for Visual Question Answering☆450Updated 4 years ago
- M-SENA: All-in-One Platform for Multimodal Sentiment Analysis☆84Updated 2 years ago