Vision-CAIR / affectiveVisDial
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for affectiveVisDial
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆30Updated 7 months ago
- Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT☆119Updated 6 months ago
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆51Updated 4 months ago
- ☆46Updated 3 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆39Updated 2 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆35Updated last year
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Updated 6 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆58Updated 4 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆13Updated 3 months ago
- ☆55Updated last year
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆20Updated 10 months ago
- ☆17Updated 4 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆70Updated 5 months ago
- Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆31Updated 5 months ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆91Updated last year
- A PyTorch implementation of EmpiricalMVM☆39Updated 10 months ago
- ☆60Updated last year
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆34Updated 6 months ago
- ☆25Updated this week
- [NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering☆178Updated 9 months ago
- ☆30Updated last month
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆39Updated last year
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆87Updated this week
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆37Updated 2 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆16Updated 2 weeks ago
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆56Updated 5 months ago
- ☆18Updated last month
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆18Updated 11 months ago
- ☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆27Updated 4 months ago
- official repository for DiffCap: Exploring Continuous Diffusion on Image Captioning☆7Updated last year