NMS05 / Audio-Visual-Deception-Detection-DOLOS-Dataset-and-Parameter-Efficient-Crossmodal-LearningView external linksLinks
☆29May 8, 2024Updated last year
Alternatives and similar repositories for Audio-Visual-Deception-Detection-DOLOS-Dataset-and-Parameter-Efficient-Crossmodal-Learning
Users that are interested in Audio-Visual-Deception-Detection-DOLOS-Dataset-and-Parameter-Efficient-Crossmodal-Learning are comparing it to the libraries listed below
Sorting:
- ☆16Jan 23, 2026Updated 3 weeks ago
- This is a project on visual spatial reasoning tasks-SIBench☆25Jan 12, 2026Updated last month
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- [ICCV 2025 DeepID Challenge] Official 1st Place in both tracks (Detection & Localization)☆17Dec 24, 2025Updated last month
- ☆10May 16, 2023Updated 2 years ago
- Self-similarity Prior Distillation for Unsupervised Remote Physiological Measurement☆10Oct 18, 2024Updated last year
- ☆10Jan 13, 2026Updated last month
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆19Jan 17, 2026Updated 3 weeks ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆48Dec 20, 2024Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- ☆10Nov 4, 2024Updated last year
- The dataset and codes of the paper UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-Modal Learning.☆16Sep 21, 2025Updated 4 months ago
- This is the code corresponding to the paper "Resolve Domain Conflicts for Generalizable Remote Physiological Measurement." accepted in AC…☆15Apr 15, 2024Updated last year
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆20Nov 21, 2025Updated 2 months ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- ☆12Aug 8, 2024Updated last year
- Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"☆18Jun 3, 2025Updated 8 months ago
- The official implementation of NeurlPS 2025 D&B paper: IndustryEQA: Pushing the frontiers of Embodied Question Answering in Industrial Sc…☆12Sep 25, 2025Updated 4 months ago
- ☆13Jul 28, 2024Updated last year
- ☆12Mar 5, 2024Updated last year
- ☆14May 20, 2025Updated 8 months ago
- ☆13May 21, 2024Updated last year
- The official repository for "SurgNet: Self-supervised Pretraining with Semantic Consistency for Vessel and Instrument Segmentation in Sur…☆14Dec 30, 2024Updated last year
- rPPG; domain generalization; domain-label-free approach; NEuron STructure modeling (NEST);agnostic domain generalization.☆48Feb 2, 2024Updated 2 years ago
- AutoGesture with Temporal Difference Convolutions (TIP'21)☆53May 27, 2021Updated 4 years ago
- ☆13Oct 9, 2024Updated last year
- ☆15Jan 22, 2024Updated 2 years ago
- Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning (CVPR 2025, pytorch co…☆14Sep 29, 2025Updated 4 months ago
- ☆25Sep 18, 2025Updated 4 months ago
- ☆11May 7, 2022Updated 3 years ago
- The official PyTorch implementation of "MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion" in ECCV 2024.☆18Jul 6, 2025Updated 7 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year