apugoneappu / ask_me_anythingLinks
An easy-to-use app to visualise attentions of various VQA models.
☆41Updated 2 years ago
Alternatives and similar repositories for ask_me_anything
Users that are interested in ask_me_anything are comparing it to the libraries listed below
Sorting:
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆563Updated last year
- Generic framework for ML projects☆19Updated 2 years ago
- ML/DL meeting group at IIT Kharagpur☆47Updated 4 years ago
- Labs for the course on Meta Learning at BITS-Goa☆34Updated 4 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆466Updated 4 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆169Updated 6 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆75Updated 5 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆191Updated 2 years ago
- Learn to build your neural network using PyTorch☆42Updated 6 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆47Updated 5 years ago
- A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"☆82Updated 3 years ago
- Neural Turing Machines in Pytorch.☆46Updated 6 years ago
- Implementation of the Object Relation Transformer for Image Captioning☆178Updated 9 months ago
- PyTorch bottom-up attention with Detectron2☆233Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆97Updated 5 years ago
- Python app to help you become prudent in your spendings☆30Updated 4 years ago
- Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)☆95Updated last year
- Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".☆75Updated last year
- Grid features pre-training code for visual question answering☆269Updated 3 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- Transformer-based image captioning extension for pytorch/fairseq☆317Updated 4 years ago
- Official EvalAI Command Line Tool☆55Updated 2 months ago
- A tool to prepare for GRE using command line terminal. Build in process.☆8Updated 4 years ago
- The Easy Visual Question Answering dataset.☆33Updated last year
- Attention-based Visual Question Answering in Torch☆100Updated 7 years ago
- Code for our paper: *Shamsian, *Kleinfeld, Globerson & Chechik, "Learning Object Permanence from Video"☆68Updated 7 months ago
- generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset☆80Updated 7 years ago
- Strong baseline for visual question answering☆240Updated 2 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 4 years ago
- Visualize synonyms and common confusing words in an interactive network☆53Updated 5 years ago