NMS05 / Audio-Visual-Deception-Detection-DOLOS-Dataset-and-Parameter-Efficient-Crossmodal-Learning
☆13Updated 4 months ago
Related projects: ⓘ
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆19Updated 4 months ago
- ☆30Updated 6 months ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆9Updated last week
- Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues (TDSC'24)☆11Updated 8 months ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆17Updated 5 months ago
- [MM 2023] Toward High Quality Facial Representation Learning☆16Updated 10 months ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆22Updated last month
- GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing☆21Updated 5 months ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆19Updated 2 months ago
- Official implementation for CIGN☆14Updated last year
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆16Updated 2 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆11Updated 6 months ago
- ☆9Updated 9 months ago
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆34Updated 3 months ago
- [ACM MM 2023] QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation☆11Updated 3 months ago
- ☆13Updated 7 months ago
- ☆39Updated last month
- [ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.☆38Updated 2 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆14Updated last year
- ☆21Updated last year
- Disentangled Pre-training for Human-Object Interaction Detection☆10Updated 3 weeks ago
- Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video" (ECCV2024)☆15Updated last month
- The official code implementation of Generalized Category Discovery in Semantic Segmentation☆11Updated 9 months ago
- Code for DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection☆44Updated last year
- ☆17Updated 5 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆24Updated 2 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆37Updated 2 weeks ago
- A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild.☆32Updated this week
- ☆16Updated 11 months ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆20Updated last month