NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
1,999Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for VILA