TheoCoombes / ClipCap

Using pretrained encoder and language models to generate captions from multimedia inputs.
94Updated last year

Alternatives and similar repositories for ClipCap:

Users that are interested in ClipCap are comparing it to the libraries listed below