scofield7419 / EmpathyEarLinks
Multimodal Empathetic Chatbot
☆42Updated last year
Alternatives and similar repositories for EmpathyEar
Users that are interested in EmpathyEar are comparing it to the libraries listed below
Sorting:
- ☆15Updated 2 years ago
- GPT-4V with Emotion☆93Updated last year
- HumanOmni☆189Updated 5 months ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 9 months ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆14Updated 9 months ago
- Implementation for the paper "Can Language Models Learn to Listen?"☆65Updated last year
- ☆75Updated 5 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆54Updated 11 months ago
- [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition☆111Updated 9 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆48Updated last month
- [ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆71Updated 5 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆42Updated 4 months ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆47Updated 2 weeks ago
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆33Updated 3 weeks ago
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆79Updated 2 weeks ago
- ☆14Updated last month
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆20Updated last week
- NeurIPS'2023 official implementation code☆65Updated last year
- Narrative movie understanding benchmark☆74Updated 2 months ago
- OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Rea…☆93Updated last month
- Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)☆221Updated this week
- Graph learning framework for long-term video understanding☆65Updated 3 weeks ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 6 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated last year
- ☆187Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 3 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Updated 5 months ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆86Updated 6 months ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 8 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆34Updated 3 months ago