scofield7419 / EmpathyEar
Multimodal Empathetic Chatbot
☆29Updated 6 months ago
Alternatives and similar repositories for EmpathyEar:
Users that are interested in EmpathyEar are comparing it to the libraries listed below
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆48Updated 4 months ago
- Implementation for the paper "Can Language Models Learn to Listen?"☆61Updated last year
- GPT-4V with Emotion☆89Updated last year
- ☆15Updated last year
- Pre-trained model weights of MAE-Face.☆29Updated last year
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆23Updated 10 months ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆36Updated 5 months ago
- LMM which strictly superset LLM embedded☆37Updated 2 months ago
- Official implementation of MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control☆32Updated last week
- A project for tri-modal LLM benchmarking and instruction tuning.☆19Updated 2 months ago
- ☆10Updated 7 months ago
- ☆66Updated 2 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆47Updated 5 months ago
- ☆32Updated 4 months ago
- ☆14Updated 4 months ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆62Updated 8 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆39Updated last week
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆23Updated last year
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆69Updated 2 weeks ago
- ☆31Updated 10 months ago
- Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆32Updated 7 months ago
- ☆38Updated last month
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆40Updated 5 months ago
- NeurIPS'2023 official implementation code☆59Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆35Updated 10 months ago
- ☆15Updated 2 years ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆23Updated 4 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆128Updated 4 months ago
- Data-Efficient Multimodal Fusion on a Single GPU☆52Updated 8 months ago
- EmoStyle project page☆37Updated 10 months ago