atsiami / STAViSLinks
Spatio-Temporal AudioVisual Saliency Network
☆52Updated last year
Alternatives and similar repositories for STAViS
Users that are interested in STAViS are comparing it to the libraries listed below
Sorting:
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆68Updated last month
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆50Updated 4 years ago
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated 2 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 4 years ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆43Updated 3 years ago
- Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)☆142Updated 2 years ago
- Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection (ICCV 2019)☆111Updated 2 years ago
- Localizing Visual Sounds the Hard Way☆81Updated 3 years ago
- Unified Image and Video Saliency Modeling (ECCV 2020)☆146Updated last year
- Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization☆21Updated 2 years ago
- Python script for downloading Kinetics datasets (Kinetics400, Kinetics600, Kinetics700)☆19Updated 5 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- [Pattern Recognition]Video Saliency Prediction using Enhanced Spatiotemporal Alignment Network☆25Updated 4 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆42Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100Updated 4 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆76Updated 4 years ago
- ☆20Updated 2 years ago
- ☆26Updated 5 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆120Updated 5 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆107Updated last year
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆113Updated 4 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 4 years ago
- The official implementation of "Pyramid constrained self-attention network for fast video salient object detection"☆57Updated 3 years ago
- Implementations of Transformers for Video☆24Updated 4 years ago
- ☆15Updated 2 years ago
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning☆69Updated 4 years ago
- ☆36Updated 3 years ago
- Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19☆56Updated 5 years ago
- ☆10Updated 2 years ago