NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
2,835Updated this week

Alternatives and similar repositories for VILA:

Users that are interested in VILA are comparing it to the libraries listed below