ufal / MLASKLinks
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆12Updated last year
Alternatives and similar repositories for MLASK
Users that are interested in MLASK are comparing it to the libraries listed below
Sorting:
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Updated 2 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆14Updated 2 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆40Updated 4 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆10Updated 2 years ago
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆17Updated 2 years ago
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆100Updated 3 months ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- source code for ICASSP 2022 paper: EmotionFlow: Capture the Dialogue Level Emotion Transitions☆27Updated 3 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆54Updated 2 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆52Updated 3 years ago
- code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"☆77Updated 2 years ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆125Updated 2 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆27Updated 2 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆78Updated 2 years ago
- Official repo for the paper "Multimodal Phased Transformer for Sentiment Analysis".☆21Updated 2 months ago
- CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)☆113Updated 4 years ago
- [COLING 2022] Learning from Adjective-Noun Pairs: A Knowledge-enhanced Framework for Target-Oriented Multimodal Sentiment Classification☆14Updated 2 years ago
- ☆16Updated 4 years ago
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆33Updated 6 months ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆77Updated last month
- ☆48Updated 6 years ago
- Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition☆61Updated 2 years ago
- ☆206Updated 3 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆30Updated 4 years ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆17Updated 2 years ago
- Multi-Scale Attention for Audio Question Answering☆28Updated 2 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Updated 4 years ago
- ☆16Updated 3 years ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆16Updated 2 months ago
- This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information M…☆189Updated 2 years ago