dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
736Updated 3 months ago

Related projects

Alternatives and complementary repositories for LLaMA-VID