GT-Vision-Lab / VQA_LSTM_CNN

Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.
376Updated 5 years ago

Related projects

Alternatives and complementary repositories for VQA_LSTM_CNN