darthgera123 / Multimodal-SummarizationLinks
Summarization of Multimodal articles
☆10Updated 3 years ago
Alternatives and similar repositories for Multimodal-Summarization
Users that are interested in Multimodal-Summarization are comparing it to the libraries listed below
Sorting:
- A Full-Scale Dataset for Multi-modal Summarization☆16Updated 4 years ago
- Code for the paper Multimodal Abstractive Summarization with Trimodal Hierarchical Attention☆20Updated 4 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆168Updated 7 years ago
- Code base for the paper "Latent variable model for multi-modal translation".☆17Updated last year
- PyTorch bottom-up attention with Detectron2☆239Updated 4 years ago
- The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".☆56Updated 4 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Updated 4 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Updated 4 years ago
- BERT + Image Captioning☆135Updated 5 years ago
- Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network☆321Updated 6 years ago
- Code for our Paper, 'Summaformers @ LaySumm 20, LongSumm 20' at EMNLP 2020, Scholarly Document Processing Workshop☆12Updated 5 years ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆165Updated 3 years ago
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Updated 2 years ago
- ☆67Updated 3 years ago
- Show and Tell : A Neural Image Caption Generator☆112Updated 5 years ago
- text generation from keywords using transformer model☆12Updated 6 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Updated 7 years ago
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos☆12Updated 5 years ago
- Dataset and codes for our IJCAI 2019 paper "Adapting BERT for Target-Oriented Multimodal Sentiment Classification"☆86Updated 5 years ago
- This repository gives a GUI using PyQt4 for VQA demo using Keras Deep Learning Library. The VQA model is created using Pre-trained VGG-1…☆46Updated 4 years ago
- Image Captioning: Implementing the Neural Image Caption Generator☆21Updated 5 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆77Updated 6 years ago
- A PyTorch reimplementation of bottom-up-attention models☆302Updated 3 years ago
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Updated 5 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339Updated 4 years ago
- Transformer-based image captioning extension for pytorch/fairseq☆318Updated 5 years ago
- ☆15Updated 5 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆11Updated 4 years ago
- A GCN based visual question generation model☆13Updated 6 years ago
- ROCK model for Knowledge-Based VQA in Videos☆31Updated 5 years ago