google-research-datasets / conceptual-12mView on GitHub
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
418Jul 14, 2025Updated 7 months ago

Alternatives and similar repositories for conceptual-12m

Users that are interested in conceptual-12m are comparing it to the libraries listed below

Sorting:

Are these results useful?