google-research-datasets / conceptual-12mView on GitHub
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
421Jul 14, 2025Updated 8 months ago

Alternatives and similar repositories for conceptual-12m

Users that are interested in conceptual-12m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?