atsiami / STAViSLinks
Spatio-Temporal AudioVisual Saliency Network
☆55Updated 2 years ago
Alternatives and similar repositories for STAViS
Users that are interested in STAViS are comparing it to the libraries listed below
Sorting:
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆75Updated 6 months ago
- Unified Image and Video Saliency Modeling (ECCV 2020)☆152Updated last year
- ☆36Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆130Updated 4 years ago
- Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization☆21Updated 3 years ago
- Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection (ICCV 2019)☆115Updated 2 years ago
- [ICLR2021] official implementation of CT-Net☆37Updated 4 years ago
- ☆20Updated 2 years ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Updated 3 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆120Updated 5 years ago
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆50Updated 5 years ago
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated 2 years ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)☆145Updated 3 years ago
- [Pattern Recognition]Video Saliency Prediction using Enhanced Spatiotemporal Alignment Network☆25Updated 4 years ago
- The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"☆27Updated 2 years ago
- The official implementation of "Pyramid constrained self-attention network for fast video salient object detection"☆57Updated 4 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated last year
- Localizing Visual Sounds the Hard Way☆82Updated 3 years ago
- Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19☆56Updated 6 years ago
- ☆49Updated 5 years ago
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning☆70Updated 5 years ago
- ☆24Updated 2 years ago
- ☆26Updated 6 years ago
- ☆17Updated 3 years ago
- ☆12Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Updated 3 years ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆59Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆42Updated 4 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆88Updated 5 years ago