foundation-multimodal-models / CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
☆57Updated 5 months ago
Alternatives and similar repositories for CAL:
Users that are interested in CAL are comparing it to the libraries listed below
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models