Vision-CAIR / affectiveVisDial
☆12Updated 6 months ago
Alternatives and similar repositories for affectiveVisDial:
Users that are interested in affectiveVisDial are comparing it to the libraries listed below
- Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT☆125Updated 8 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆48Updated 4 months ago
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Updated 8 months ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆22Updated last year
- A Prompted Visual Hallucination Evaluation Dataset, featuring over 100,000 data points and four advanced evaluation modes. The dataset in…☆11Updated last month
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆22Updated 3 months ago
- Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆32Updated 7 months ago
- Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)☆13Updated 6 months ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆51Updated last year
- ☆51Updated 5 months ago
- GPT-4V with Emotion☆89Updated last year
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆43Updated last year
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"☆21Updated last year
- [ACL2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆12Updated 2 months ago
- A PyTorch implementation of EmpiricalMVM☆39Updated last year
- Official repository for the A-OKVQA dataset☆69Updated 8 months ago
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆52Updated 6 months ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆97Updated last year
- ☆15Updated 7 months ago
- ☆14Updated last year
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆41Updated 9 months ago
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆22Updated 2 years ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 3 months ago
- ☆22Updated 3 months ago
- ☆10Updated 6 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆21Updated last week
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media, EMNLP 2021☆38Updated 4 months ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆121Updated last year
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆58Updated 7 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆14Updated 5 months ago