StanfordMIMI / villa

ViLLA: Fine-grained vision-language representation learning from real-world data
40Updated last year

Related projects

Alternatives and complementary repositories for villa