Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking

[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
20Updated 4 months ago

Related projects: