qywu / FaceChatLinks
☆15Updated 2 years ago
Alternatives and similar repositories for FaceChat
Users that are interested in FaceChat are comparing it to the libraries listed below
Sorting:
- Multimodal Empathetic Chatbot☆55Updated last year
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated last year
- GPT-4V with Emotion☆96Updated 2 years ago
- ☆66Updated 2 years ago
- ☆20Updated 7 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆85Updated 2 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL …☆38Updated 2 years ago
- ☆71Updated 8 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆56Updated 10 months ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25Updated 8 months ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆124Updated 8 months ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated 2 years ago
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Updated last year
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Updated last year
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆108Updated 8 months ago
- [TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.☆232Updated 2 years ago
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆29Updated 3 years ago
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆30Updated last year
- Implementation for the paper "Can Language Models Learn to Listen?"☆70Updated 2 years ago
- Narrative movie understanding benchmark☆76Updated 7 months ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated last year
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆108Updated 3 years ago
- HumanOmni☆216Updated 11 months ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Updated last year
- Video dataset dedicated to portrait-mode video recognition.☆55Updated 3 months ago
- [ACM ICMR'25]Official repository for "eMotions: A Large-Scale Dataset for Emotion Recognition in Short Videos"☆37Updated 6 months ago
- ☆16Updated 5 years ago