qywu / FaceChatLinks
☆15Updated 2 years ago
Alternatives and similar repositories for FaceChat
Users that are interested in FaceChat are comparing it to the libraries listed below
Sorting:
- Multimodal Empathetic Chatbot☆42Updated last year
- GPT-4V with Emotion☆93Updated last year
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 9 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆123Updated 2 months ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆14Updated 9 months ago
- Implementation for the paper "Can Language Models Learn to Listen?"☆65Updated last year
- A project for tri-modal LLM benchmarking and instruction tuning.☆42Updated 4 months ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated last year
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 6 months ago
- Video dataset dedicated to portrait-mode video recognition.☆52Updated 8 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated last year
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated last year
- ☆66Updated 2 years ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆87Updated 2 months ago
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆25Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 5 months ago
- Multimodal-Procedural-Planning☆92Updated 2 years ago
- ☆73Updated last year
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆74Updated 8 months ago
- Official repo for StableLLAVA☆95Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated last year
- [TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.☆228Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆81Updated 6 months ago
- Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL …☆36Updated last year
- Narrative movie understanding benchmark☆74Updated 2 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆130Updated 2 weeks ago
- ☆70Updated 2 months ago
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆141Updated 9 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated last year