lucidrains / MaMMUT-pytorch

Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch
β˜†98Updated last year

Alternatives and similar repositories for MaMMUT-pytorch:

Users that are interested in MaMMUT-pytorch are comparing it to the libraries listed below