exped1230 / S2-VER
The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition
☆11Updated 11 months ago
Alternatives and similar repositories for S2-VER:
Users that are interested in S2-VER are comparing it to the libraries listed below
- ☆13Updated 8 months ago
- This is the official implemantation of “Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Re…☆17Updated 2 years ago
- [ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"☆16Updated last month
- [CVPR 2025] Interpreting Object-level Foundation Models via Visual Precision Search☆16Updated last week
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆25Updated 2 months ago
- ☆7Updated 4 months ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆116Updated last year
- [ICCV 2023] Prompt-aligned Gradient for Prompt Tuning☆162Updated last year
- ☆36Updated 2 years ago
- [ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts☆12Updated 2 months ago
- [CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…☆16Updated 2 months ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆50Updated 4 months ago
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆132Updated 11 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated last month
- ☆52Updated last week
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆39Updated 5 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆16Updated 8 months ago
- [CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning☆115Updated 3 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆52Updated 6 months ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆14Updated 11 months ago
- 🌀 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆80Updated 8 months ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆22Updated 11 months ago
- ☆62Updated last year
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆48Updated last year
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning☆28Updated 6 months ago
- Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)☆25Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆17Updated 2 years ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆72Updated last year