yaohungt / Multimodal-TransformerLinks
[ACL'19] [PyTorch] Multimodal Transformer
☆890Updated 2 years ago
Alternatives and similar repositories for Multimodal-Transformer
Users that are interested in Multimodal-Transformer are comparing it to the libraries listed below
Sorting:
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆847Updated 2 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆264Updated 5 years ago
- ☆204Updated 3 years ago
- [NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning☆556Updated last year
- MMSA is a unified framework for Multimodal Sentiment Analysis.☆819Updated 5 months ago
- MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis☆240Updated 2 years ago
- Codes for paper "Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis"☆215Updated 3 years ago
- Pytorch Implementation of Tensor Fusion Networks for multimodal sentiment analysis.☆189Updated 5 years ago
- Attention-based multimodal fusion for sentiment analysis☆355Updated last year
- [AAAI 2018] Memory Fusion Network for Multi-view Sequential Learning☆114Updated 4 years ago
- This is a short tutorial for using the CMU-MultimodalSDK.☆86Updated 6 years ago
- This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information M…☆187Updated 2 years ago
- A curated list of Multimodal Related Research.☆1,355Updated last year
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆352Updated 5 years ago
- Paper List for Multimodal Sentiment Analysis☆101Updated 4 years ago
- Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"☆1,471Updated last year
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,153Updated 2 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆793Updated 3 years ago
- Multi-Modal Transformer for Video Retrieval☆260Updated 8 months ago
- ☆190Updated 2 years ago
- A Tool for extracting multimodal features from videos.☆172Updated 2 years ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆275Updated 5 months ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆121Updated 3 years ago
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆82Updated 4 years ago
- ☆172Updated 5 years ago
- ☆268Updated last year
- 🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬☆121Updated 4 years ago
- M-SENA: All-in-One Platform for Multimodal Sentiment Analysis☆89Updated 3 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆954Updated 2 years ago
- A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis☆125Updated 4 months ago