QinYang12 / SVGC-AVALinks
☆14Updated 10 months ago
Alternatives and similar repositories for SVGC-AVA
Users that are interested in SVGC-AVA are comparing it to the libraries listed below
Sorting:
- SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos☆13Updated last year
- Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio☆17Updated 3 years ago
- Official PyTorch implementation of our paper "Spherical Vision Transformer for 360° Video Saliency Prediction" (BMVC 2023)☆16Updated last year
- Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications☆13Updated 2 years ago
- The improved version of our previous work SalGAN360 which predict visual saliency on 360° image☆15Updated 4 years ago
- 360 video Head and Eye movement prediction framework with two-stream models☆31Updated 3 years ago
- Saliency prediction on 360° image with SalGAN☆16Updated 4 years ago
- Awesome works and resources are relevant to 360 processing.☆57Updated last year
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆20Updated 2 years ago
- Official repository of Panoramic Vision Transformer for Saliency Detection in 360° Videos (ECCV 2022)☆35Updated 2 years ago
- soundnet and localize sound source☆11Updated 4 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- Visual saliency estimation for 360° images using stacked autoencoder.☆14Updated 8 years ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆32Updated 10 months ago
- Pytorch implementation of "ST360IQ: NO-REFERENCE OMNIDIRECTIONAL IMAGE QUALITY ASSESSMENT WITH SPHERICAL VISION TRANSFORMERS"☆13Updated 2 years ago
- Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need☆10Updated 3 years ago
- ☆39Updated 7 years ago
- Repository for implementation of SalNet360 in Caffe☆18Updated 6 years ago
- Panoramic audiovisual salient object segmentation☆30Updated last year
- Spatio-Temporal AudioVisual Saliency Network☆52Updated last year
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Updated 5 years ago
- Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"☆28Updated 2 years ago
- ☆10Updated 2 years ago
- Dataset and source code for "CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360° Videos" in IEEE Tr…☆25Updated last year
- Code and models for "Panoramic convolutions for 360º single-image saliency prediction"☆40Updated 4 years ago
- Official code and dataset of MVFormer☆8Updated last year
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆14Updated last year
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆68Updated 2 years ago
- ☆29Updated 3 years ago
- CP-360-Weakly-Supervised-Saliency☆30Updated 6 years ago