ivendrov / order-embedding
Implementation of caption-image retrieval from the paper "Order-Embeddings of Images and Language"
☆185Updated 8 years ago
Alternatives and similar repositories for order-embedding:
Users that are interested in order-embedding are comparing it to the libraries listed below
- ☆78Updated 8 years ago
- Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016☆112Updated 8 years ago
- Simple Baseline for Visual Question Answering☆186Updated 8 years ago
- Review Network for Caption Generation☆181Updated 7 years ago
- Coherence + Recurrent Neural Network + Convolutional Neural Network☆142Updated 8 years ago
- Code for detecting visual concepts in images.☆150Updated 7 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Updated 6 years ago
- Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7☆148Updated 5 years ago
- Re-implementation of the m-RNN model using TensorFLow☆108Updated 8 years ago
- code for Stacked attention networks for image question answering☆108Updated 8 years ago
- A bare-bones NumPy implementation of "Multimodal Neural Language Models" (Kiros et al, ICML 2014)☆55Updated 8 years ago
- Code for Structured Attention Networks https://arxiv.org/abs/1702.00887☆238Updated 8 years ago
- ☆219Updated 8 years ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆94Updated 2 years ago
- Multimodal Residual Learning for Visual QA (NIPS 2016)☆38Updated 8 years ago
- DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction☆94Updated 9 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆70Updated 7 years ago
- Contains approaches introduced in the MovieQA benchmark dataset paper☆79Updated 8 years ago
- Implementation of CVPR 2016 paper☆75Updated 4 years ago
- Mixed Incremental Cross-Entropy REINFORCE ICLR 2016☆331Updated 8 years ago
- Visual7W visual question answering models☆63Updated 5 years ago
- Code and models from the paper "Layer Normalization"☆247Updated 8 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆99Updated 8 years ago
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆164Updated 8 years ago
- ☆69Updated 6 years ago
- Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language M…☆426Updated 8 years ago
- Multi-modal features toolkit in Python☆79Updated 5 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆271Updated 4 years ago
- Visual Question Answering Project with state of the art single Model performance.☆131Updated 6 years ago
- Code to accompany the paper "Learning Graphical State Transitions"☆170Updated 7 years ago