TXH-mercury / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
272Updated last year

Alternatives and similar repositories for VAST:

Users that are interested in VAST are comparing it to the libraries listed below