hasanar1f / HiRED

HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Vision-Language Models (e.g., LLaVA-Next) under a fixed token budget.
13Updated 2 months ago

Related projects

Alternatives and complementary repositories for HiRED