ufal / MLASK
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MLASK
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Updated last year
- ACM MM '22: Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning☆15Updated last year
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆49Updated last year
- Video Feature Extractor for S3D-HowTo100M☆28Updated 3 years ago
- ☆13Updated 3 weeks ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆86Updated 3 years ago
- CLUE code☆12Updated 2 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆13Updated last year
- This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment An…☆65Updated last year
- ☆21Updated last year
- Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition☆55Updated 2 years ago
- ☆18Updated 3 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆45Updated last year
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆71Updated last year
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆30Updated 7 months ago
- ☆48Updated 5 years ago
- This is the code for Coupled-translation Fusion Network.☆9Updated 2 years ago
- 😎 All your need for future is FollowGPT.☆12Updated last year
- MUSIC-AVQA, CVPR2022 (ORAL)☆67Updated last year
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆30Updated 3 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆11Updated last year
- source code for ICASSP 2022 paper: EmotionFlow: Capture the Dialogue Level Emotion Transitions☆26Updated 2 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Updated last year
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆78Updated 11 months ago
- ☆21Updated 7 months ago
- ☆7Updated last year
- ☆28Updated 2 years ago
- 16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)☆16Updated last year
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆21Updated last year