apugoneappu / ask_me_anything
An easy-to-use app to visualise attentions of various VQA models.
☆41Updated 2 years ago
Alternatives and similar repositories for ask_me_anything:
Users that are interested in ask_me_anything are comparing it to the libraries listed below
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆559Updated last year
- A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"☆81Updated 2 years ago
- Labs for the course on Meta Learning at BITS-Goa☆34Updated 3 years ago
- ML/DL meeting group at IIT Kharagpur☆47Updated 4 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆74Updated 5 years ago
- PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning☆169Updated 6 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 3 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated last year
- Generic framework for ML projects☆19Updated 2 years ago
- Learn to build your neural network using PyTorch☆43Updated 5 years ago
- A tool to prepare for GRE using command line terminal. Build in process.☆8Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆96Updated 4 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆45Updated 4 years ago
- ☆189Updated 3 years ago
- Official EvalAI Command Line Tool☆55Updated 5 months ago
- Grid features pre-training code for visual question answering☆268Updated 3 years ago
- PyTorch bottom-up attention with Detectron2☆231Updated 3 years ago
- Neural-symbolic visual question answering☆262Updated last year
- 🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle☆88Updated last year
- Entire Proxy Settings: One Script To Set Them All☆42Updated 9 months ago
- ☆28Updated 6 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆105Updated 5 years ago
- ☆91Updated 2 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆465Updated 3 years ago
- Code for our paper: *Shamsian, *Kleinfeld, Globerson & Chechik, "Learning Object Permanence from Video"☆68Updated 2 months ago
- Implementation of the Object Relation Transformer for Image Captioning☆177Updated 4 months ago
- Code of Dense Relational Captioning☆68Updated last year
- Example of a Cover letter for AI Residency☆80Updated 5 years ago
- Neural Turing Machines in Pytorch.☆46Updated 6 years ago
- PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind P…☆60Updated 6 years ago