li-xirong / hierse
Zero-shot image tagging by hierarchical semantic embedding
☆77Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for hierse
- Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016☆113Updated 8 years ago
- ☆79Updated 8 years ago
- Code for detecting visual concepts in images.☆151Updated 6 years ago
- Self-supervised learning of visual features through embedding images into text topic spaces☆95Updated 2 years ago
- Code for paper "Image Caption Generation with Text-Conditional Semantic Attention"☆61Updated 7 years ago
- Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016☆86Updated 7 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆164Updated 5 years ago
- ☆16Updated 7 years ago
- Generate captions for an image using PyTorch☆128Updated 7 years ago
- Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering☆100Updated 7 years ago
- Faster-RCNN based on Densecap(deprecated)☆85Updated 8 years ago
- A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017☆145Updated 7 years ago
- code for Stacked attention networks for image question answering☆107Updated 7 years ago
- Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization☆62Updated 5 years ago
- Learning to Evaluate Image Captioning. CVPR 2018☆83Updated 6 years ago
- Author's implementation of 'Unsupervised Visual Representation Learning by Context Prediction'☆115Updated 7 years ago
- Simple Baseline for Visual Question Answering☆186Updated 7 years ago
- DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction☆95Updated 8 years ago
- Image Captioning with Deep Bidirectional LSTMs☆86Updated 5 months ago
- Image Caption and Text to Image papers.☆68Updated 6 years ago
- Code for ECCV 2016 paper, Taxonomy-Regularized Semantic Deep Convolutional Neural Networks☆25Updated 8 years ago
- Torch implementation for Stacked Attention Networks☆24Updated 7 years ago
- Visual Question Answering Project with state of the art single Model performance.☆132Updated 6 years ago
- ☆21Updated 8 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆68Updated 7 years ago
- Multimodal Residual Learning for Visual QA (NIPS 2016)☆39Updated 7 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆71Updated 7 years ago
- Contains approaches introduced in the MovieQA benchmark dataset paper☆80Updated 7 years ago