showlab / VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
12Updated last week

Related projects

Alternatives and complementary repositories for VisInContext