QinYang12 / SVGC-AVALinks
☆14Updated 11 months ago
Alternatives and similar repositories for SVGC-AVA
Users that are interested in SVGC-AVA are comparing it to the libraries listed below
Sorting:
- SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos☆13Updated last year
- Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio☆17Updated 3 years ago
- 360 video Head and Eye movement prediction framework with two-stream models☆31Updated 3 years ago
- The improved version of our previous work SalGAN360 which predict visual saliency on 360° image☆15Updated 4 years ago
- Official repository of Panoramic Vision Transformer for Saliency Detection in 360° Videos (ECCV 2022)☆35Updated 2 years ago
- soundnet and localize sound source☆11Updated 4 years ago
- Awesome works and resources are relevant to 360 processing.☆57Updated last year
- Dataset and source code for "CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360° Videos" in IEEE Tr…☆25Updated 2 years ago
- Saliency prediction on 360° image with SalGAN☆16Updated 4 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆21Updated 2 years ago
- Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need☆10Updated 3 years ago
- ☆39Updated 7 years ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆32Updated 11 months ago
- Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"☆29Updated 2 years ago
- Visual saliency estimation for 360° images using stacked autoencoder.☆14Updated 8 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆68Updated 2 years ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Repository for implementation of SalNet360 in Caffe☆18Updated 7 years ago
- ☆22Updated 2 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Updated 6 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆14Updated 4 months ago
- Spatio-Temporal AudioVisual Saliency Network☆52Updated last year
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆11Updated 8 months ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆13Updated 2 years ago
- A curated list of audio-visual learning methods and datasets.☆263Updated 7 months ago
- Spatial Audio Generation☆111Updated 2 years ago
- A Multi-channel CNN for Blind 360-Degree Image Quality Assessment☆26Updated 2 years ago
- [CVPR 2024] Official PyTorch implementation for the paper: Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Archi…☆21Updated 3 months ago
- ☆29Updated 3 years ago