IsaacRodgz / ConcatBERT

Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
38Updated 2 years ago

Related projects

Alternatives and complementary repositories for ConcatBERT