zwq2018 / Multi-modal-Self-instruct

The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
67Updated last month

Alternatives and similar repositories for Multi-modal-Self-instruct:

Users that are interested in Multi-modal-Self-instruct are comparing it to the libraries listed below