guanxiongsun / STPNLinks
[ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction
☆11Updated last year
Alternatives and similar repositories for STPN
Users that are interested in STPN are comparing it to the libraries listed below
Sorting:
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆30Updated 3 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆39Updated 2 years ago
- ☆49Updated 2 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆70Updated last month
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆60Updated 2 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- ☆35Updated last year
- A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"☆57Updated 4 years ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆30Updated 2 years ago
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆18Updated 2 years ago
- Code for <Domain Adaptive Video Segmentation via Temporal Consistency Regularization> in ICCV 2021☆42Updated 3 years ago
- The official github repo for "Test-Time Training with Masked Autoencoders"☆87Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- ☆39Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 7 months ago
- Official codes for ConMIM (ICLR 2023)☆60Updated 2 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Updated 2 years ago
- ☆45Updated 2 years ago
- ☆43Updated 2 years ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆42Updated 10 months ago
- code base for vision transformers☆36Updated 3 years ago
- Test-Time Training on Video Streams☆64Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆47Updated 3 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated 2 years ago
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago
- Accepted by CVPR 2022☆36Updated 3 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- Multi-Scale Spatio-Temporal Attention based Video Instance Segmentation☆40Updated 2 years ago
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago