mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.
770Updated 3 months ago

Related projects

Alternatives and complementary repositories for MINT-1T