google-research-datasets / wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
β˜†1,022Updated 4 months ago

Alternatives and similar repositories for wit:

Users that are interested in wit are comparing it to the libraries listed below