scofield7419 / EmpathyEar
Multimodal Empathetic Chatbot
☆39Updated 10 months ago
Alternatives and similar repositories for EmpathyEar
Users that are interested in EmpathyEar are comparing it to the libraries listed below
Sorting:
- GPT-4V with Emotion☆92Updated last year
- ☆15Updated 10 months ago
- OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Rea…☆49Updated this week
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 3 months ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆14Updated last month
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆53Updated 8 months ago
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆44Updated 9 months ago
- ☆15Updated 2 years ago
- Explainable Multimodal Emotion Reasoning (EMER), Open-vocabulary MER (OV-MER), and AffectGPT☆166Updated last week
- Implementation for the paper "Can Language Models Learn to Listen?"☆65Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆42Updated last month
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆14Updated last month
- Narrative movie understanding benchmark☆70Updated last year
- HumanOmni☆161Updated 2 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …☆111Updated last month
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆22Updated 9 months ago
- ☆48Updated 10 months ago
- ☆18Updated 4 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆31Updated 3 weeks ago
- Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆67Updated 2 months ago
- [ACL2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆12Updated 6 months ago
- ☆44Updated last month
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆45Updated 3 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆34Updated last month
- Official repo for StableLLAVA☆95Updated last year
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆81Updated 4 months ago
- ☆86Updated 8 months ago
- ☆21Updated 2 weeks ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆69Updated 3 months ago