liuzuyan / ElasticCache
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆46Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for ElasticCache
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆129Updated 2 weeks ago
- 🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…☆64Updated last week
- (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment☆63Updated 2 weeks ago
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆65Updated last week
- An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks☆59Updated 3 weeks ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆86Updated 7 months ago
- ☆85Updated 3 weeks ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆127Updated 3 months ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆33Updated 2 years ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆108Updated last month
- Mixed precision inference by Tensorrt-LLM