guanxiongsun / STPNLinks
[ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction
☆11Updated last year
Alternatives and similar repositories for STPN
Users that are interested in STPN are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆60Updated 2 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆39Updated 2 years ago
- ☆48Updated last year
- ☆35Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆34Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆17Updated 2 years ago
- ☆39Updated last year
- Code for <Domain Adaptive Video Segmentation via Temporal Consistency Regularization> in ICCV 2021☆42Updated 3 years ago
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆30Updated 3 years ago
- Accepted by CVPR 2022☆36Updated 3 years ago
- code base for vision transformers☆36Updated 3 years ago
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆40Updated 2 years ago
- ☆48Updated 2 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆70Updated 2 weeks ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆30Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆54Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆18Updated 2 years ago
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago
- A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"☆57Updated 4 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Updated 2 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated last year
- ☆15Updated 3 years ago
- This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos, which has bee…☆53Updated 2 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆42Updated 10 months ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- Learning Multiple Dense Prediction Tasks from Partially Annotated Data - CVPR 2022☆49Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆38Updated last year