NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
1,968Updated last week

Related projects

Alternatives and complementary repositories for VILA