goodwillyoga / Flickr8k_dataset
☆22Updated 6 years ago
Alternatives and similar repositories for Flickr8k_dataset
Users that are interested in Flickr8k_dataset are comparing it to the libraries listed below
Sorting:
- In-the-wild Question Answering☆15Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"☆28Updated 2 years ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11Updated 4 years ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 3 years ago
- ☆44Updated 3 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)☆21Updated 2 years ago
- Code for paper "Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling"☆7Updated 5 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆40Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆74Updated 2 years ago
- ☆53Updated 3 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- ☆11Updated 2 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing a…☆28Updated 2 years ago
- Procedural Reasoning Networks☆7Updated 4 years ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- companion code for "Learning to substitute Ingredients in Recipes"☆26Updated last year
- Visual Storytelling with Cross-Modal Rules☆7Updated 5 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 5 months ago
- TVRecap: A Dataset for Generating Stories with Character Descriptions☆20Updated last year
- bumble bee transformer☆14Updated 4 years ago
- Screenplay Summarization using Latent Narrative Structure☆37Updated 2 years ago
- ☆40Updated last year
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning☆13Updated last year