li-xirong / hierseLinks

Zero-shot image tagging by hierarchical semantic embedding

☆76

Alternatives and similar repositories for hierse

Users that are interested in hierse are comparing it to the libraries listed below

Sorting:

ronghanghu / natural-language-object-retrieval
Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016
☆112Updated 8 years ago
mjhucla / mRNN-CR
☆78Updated 8 years ago
eladhoffer / captionGen
Generate captions for an image using PyTorch
☆127Updated 8 years ago
ronghanghu / text_objseg
Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016
☆85Updated 7 years ago
woozzu / dong_iccv_2017
A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017
☆143Updated 7 years ago
s-gupta / visual-concepts
Code for detecting visual concepts in images.
☆150Updated 7 years ago
jnhwkim / nips-mrn-vqa
Multimodal Residual Learning for Visual QA (NIPS 2016)
☆38Updated 8 years ago
markdtw / vqa-winner-cvprw-2017
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
☆163Updated 6 years ago
zcyang / imageqa-san
code for Stacked attention networks for image question answering
☆108Updated 8 years ago
chingyaoc / VQA-tensorflow
Tensorflow Implementation of Deeper LSTM+ normalized CNN for Visual Question Answering
☆99Updated 8 years ago
bernard24 / Embarrassingly-simple-ZSL
This repository contains the code for the real data experiments presented in our paper “An embarrassingly simple approach to zero-shot l…
☆68Updated 9 years ago
lluisgomez / TextTopicNet
Self-supervised learning of visual features through embedding images into text topic spaces
☆94Updated 2 years ago
zhoubolei / VQAbaseline
Simple Baseline for Visual Question Answering
☆186Updated 8 years ago
LuoweiZhou / e2e-gLSTM-sc
Code for paper "Image Caption Generation with Text-Conditional Semantic Attention"
☆60Updated 7 years ago
mjhucla / P-Multimodal-Dataset-Toolbox
☆69Updated 6 years ago
LisaAnne / DCC
Implementation of CVPR 2016 paper
☆75Updated 4 years ago
cdoersch / deepcontext
Author's implementation of 'Unsupervised Visual Representation Learning by Context Prediction'
☆115Updated 8 years ago
jnhwkim / MulLowBiVQA
Hadamard Product for Low-rank Bilinear Pooling
☆70Updated 7 years ago
makarandtapaswi / MovieQA_CVPR2016
Contains approaches introduced in the MovieQA benchmark dataset paper
☆79Updated 8 years ago
HyeonwooNoh / DPPnet
DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
☆94Updated 9 years ago
deepsemantic / image_captioning
Image Captioning with Deep Bidirectional LSTMs
☆84Updated last year
jnhwkim / cbp
Multimodal Compact Bilinear Pooling for Torch7
☆69Updated 8 years ago
yanweifu / embedding_zero-shot-learning
☆20Updated 8 years ago
SinghJasdeep / Attention-on-Attention-for-VQA
Visual Question Answering Project with state of the art single Model performance.
☆131Updated 7 years ago
chingyaoc / san-torch
Torch implementation for Stacked Attention Networks
☆23Updated 8 years ago
bmsookim / wide-residual-network
Wide-residual network implementations. Best result for cifar10(97.12%), cifar100(84.12%), and other kaggle challenges
☆37Updated 8 years ago
yukezhu / visual7w-qa-models
Visual7W visual question answering models
☆63Updated 5 years ago
aviveise / 2WayNet
☆15Updated 7 years ago
watsonyanghx / Image-Text-Papers
Image Caption and Text to Image papers.
☆68Updated 7 years ago
therne / compact-bilinear-pooling-tf
Compact Bilinear Pooling (https://arxiv.org/abs/1511.06062) for TensorFlow
☆46Updated 5 years ago