qywu / FaceChatLinks
☆15Updated 2 years ago
Alternatives and similar repositories for FaceChat
Users that are interested in FaceChat are comparing it to the libraries listed below
Sorting:
- Multimodal Empathetic Chatbot☆53Updated last year
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 11 months ago
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆19Updated 7 months ago
- GPT-4V with Emotion☆97Updated 2 years ago
- A comprehensive overview of affective computing research in the era of large language models (LLMs).☆29Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Updated last year
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆29Updated 3 years ago
- ☆20Updated 6 months ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆104Updated 7 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆84Updated last year
- ☆66Updated 2 years ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated 2 years ago
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆124Updated 7 months ago
- [TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.☆232Updated 2 years ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- [NAACL Findings 2024] PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits☆66Updated last year
- KokoMind: Can LLMs Understand Social Interactions?☆104Updated 2 years ago
- ☆75Updated last year
- Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previo…☆88Updated 7 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 10 months ago
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆144Updated last year
- A project for tri-modal LLM benchmarking and instruction tuning.☆53Updated 9 months ago
- 🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)☆64Updated 2 years ago
- Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL …☆38Updated 2 years ago
- Video dataset dedicated to portrait-mode video recognition.☆55Updated 2 months ago
- ☆70Updated 7 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆74Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated last year
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Updated last year