IsaacRodgz / multimodal-transformers-movies
Experiments with multimodal deep learning models based on transformers
☆12Updated 2 years ago
Alternatives and similar repositories for multimodal-transformers-movies:
Users that are interested in multimodal-transformers-movies are comparing it to the libraries listed below
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆68Updated 3 years ago
- Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"☆30Updated 3 years ago
- Condensed Movies Challenge 2021☆17Updated 2 years ago
- ☆15Updated 4 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆75Updated last year
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆10Updated last year
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆85Updated 3 years ago
- Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning☆13Updated last year
- A Python Library for Multimodal Analysis of Movies and Content-based Movie Recommendation☆28Updated 3 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆98Updated 2 years ago
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆13Updated 8 months ago
- ☆31Updated 3 years ago
- This repository contains the implementation of the paper -- Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment An…☆68Updated last year
- [ACM MM 2022] Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection☆31Updated 2 years ago
- This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information M…☆171Updated last year
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆38Updated 9 months ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Updated 4 years ago
- ☆54Updated 2 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆143Updated last year
- CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis(MM2020)☆109Updated 4 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated this week
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- [ACM MM 2021 Oral] Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation"☆39Updated 3 years ago
- Code for NAACL 2021 paper: MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences☆42Updated last year
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Updated 2 years ago
- Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering…☆28Updated 2 years ago
- ☆199Updated 3 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago
- Paper List for Multimodal Sentiment Analysis☆98Updated 4 years ago