google-research-datasets / videoCC-data

VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automatic pipeline starting from the Conceptual Captions Image-Captioning Dataset.
76Updated last year

Related projects

Alternatives and complementary repositories for videoCC-data