lucidrains / MaMMUT-pytorch

Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch
β˜†97Updated last year

Related projects β“˜

Alternatives and complementary repositories for MaMMUT-pytorch