QinYang12 / SVGC-AVALinks
☆13Updated 9 months ago
Alternatives and similar repositories for SVGC-AVA
Users that are interested in SVGC-AVA are comparing it to the libraries listed below
Sorting:
- Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications☆13Updated 2 years ago
- Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio☆17Updated 3 years ago
- Official PyTorch implementation of our paper "Spherical Vision Transformer for 360° Video Saliency Prediction" (BMVC 2023)☆16Updated last year
- SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos☆13Updated last year
- 360 video Head and Eye movement prediction framework with two-stream models☆31Updated 3 years ago
- Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need☆10Updated 3 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- Saliency prediction on 360° image with SalGAN☆16Updated 4 years ago
- Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".☆31Updated 10 months ago
- Official repository of Panoramic Vision Transformer for Saliency Detection in 360° Videos (ECCV 2022)☆35Updated 2 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆20Updated 2 years ago
- Awesome works and resources are relevant to 360 processing.☆57Updated last year
- The improved version of our previous work SalGAN360 which predict visual saliency on 360° image☆14Updated 4 years ago
- Dataset and source code for "CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360° Videos" in IEEE Tr…☆25Updated last year
- Pytorch implementation of "ST360IQ: NO-REFERENCE OMNIDIRECTIONAL IMAGE QUALITY ASSESSMENT WITH SPHERICAL VISION TRANSFORMERS"☆13Updated 2 years ago
- Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"☆28Updated 2 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Updated 5 years ago
- soundnet and localize sound source☆11Updated 4 years ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Code and models for "Panoramic convolutions for 360º single-image saliency prediction"☆40Updated 4 years ago
- ☆39Updated 7 years ago
- Spatio-Temporal AudioVisual Saliency Network☆51Updated last year
- Visual saliency estimation for 360° images using stacked autoencoder.☆14Updated 8 years ago
- ☆10Updated 2 years ago
- Panoramic audiovisual salient object segmentation☆30Updated last year
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆12Updated last year
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆57Updated 10 months ago
- Official code and dataset of MVFormer☆9Updated last year
- ☆22Updated 2 years ago
- Source code to generate 360-degree saliency☆24Updated 3 years ago