Exploring-Embodied-Emotion-official / E3Links
☆14Updated last month
Alternatives and similar repositories for E3
Users that are interested in E3 are comparing it to the libraries listed below
Sorting:
- [ACM MM 2022 Oral] This is the official implementation of "SER30K: A Large-Scale Dataset for Sticker Emotion Recognition"☆24Updated 2 years ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆54Updated 11 months ago
- [CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…☆18Updated last month
- ☆35Updated 10 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆34Updated 3 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆78Updated 2 weeks ago
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling☆110Updated 3 weeks ago
- [ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"☆16Updated 5 months ago
- HumanOmni☆189Updated 5 months ago
- GPT-4V with Emotion☆93Updated last year
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆37Updated last month
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆72Updated 10 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆41Updated 9 months ago
- [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"☆149Updated 11 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆16Updated last year
- ☆31Updated last year
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆27Updated last year
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆86Updated 6 months ago
- Narrative movie understanding benchmark☆74Updated 2 months ago
- Welcome to the official repository of Emotion-Qwen.☆17Updated 2 months ago
- LLMBind: A Unified Modality-Task Integration Framework☆18Updated last year
- [CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"☆34Updated last year
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences