BetterZH / SEVLM-code
Training A Small Emotional Vision Language Model for Visual Art Comprehension
☆16Updated 9 months ago
Alternatives and similar repositories for SEVLM-code
Users that are interested in SEVLM-code are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆54Updated 10 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆16Updated 10 months ago
- Composed Video Retrieval☆57Updated last year
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆52Updated 8 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆47Updated 9 months ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆34Updated last year
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention