YehLi / xmodalerView on GitHub
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
970Feb 27, 2023Updated 3 years ago

Alternatives and similar repositories for xmodaler

Users that are interested in xmodaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?