TheoCoombes / ClipCap

Using pretrained encoder and language models to generate captions from multimedia inputs.
94Updated 2 years ago

Alternatives and similar repositories for ClipCap:

Users that are interested in ClipCap are comparing it to the libraries listed below