atsiami / STAViSLinks
Spatio-Temporal AudioVisual Saliency Network
☆53Updated last year
Alternatives and similar repositories for STAViS
Users that are interested in STAViS are comparing it to the libraries listed below
Sorting:
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆71Updated 3 months ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Updated 3 years ago
- ☆20Updated 2 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 4 years ago
- Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection (ICCV 2019)☆111Updated 2 years ago
- Unified Image and Video Saliency Modeling (ECCV 2020)☆146Updated last year
- Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization☆21Updated 2 years ago
- ☆16Updated 3 years ago
- [ICLR2021] official implementation of CT-Net☆37Updated 3 years ago
- [Pattern Recognition]Video Saliency Prediction using Enhanced Spatiotemporal Alignment Network☆25Updated 4 years ago
- Localizing Visual Sounds the Hard Way☆82Updated 3 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆107Updated last year
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated 2 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆76Updated 5 years ago
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆50Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆42Updated 4 years ago
- Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19☆56Updated 6 years ago
- The official implementation of "Pyramid constrained self-attention network for fast video salient object detection"☆57Updated 3 years ago
- Panoramic audiovisual salient object segmentation☆30Updated 2 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆119Updated 5 years ago
- ☆36Updated 3 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)☆142Updated 2 years ago
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning☆70Updated 5 years ago
- [CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652☆26Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100Updated 4 years ago
- ☆12Updated 2 years ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Updated 3 years ago
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 5 years ago