IBM / sifar-pytorch
super image for action recognition
☆55Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sifar-pytorch
- Code for Temporal Data Augmentations (ECCVW 2020)☆35Updated 4 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆36Updated last month
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆31Updated last year
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.☆50Updated 2 years ago
- ☆69Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognition☆32Updated 3 years ago
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆29Updated 3 years ago
- ☆108Updated 3 years ago
- Reducing spatial redundancy in video recognition. SOTA computational efficiency.☆122Updated 2 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Updated 2 years ago
- ☆34Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- ☆44Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago
- Official Code Release for Container : Context Aggregation Network☆46Updated 3 years ago
- cuda implementation of depthwise conv3d☆21Updated 3 years ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated last year
- ☆33Updated 3 years ago
- code base for vision transformers☆36Updated 2 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆40Updated last year
- ☆47Updated 2 years ago
- [CVPR2022] Unsupervised Pre-training for Temporal Action Localization Tasks (UP-TAL)☆29Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆45Updated last year
- A single stage temporal action detection toolbox based on PyTorch☆53Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- a pytorch implementation for MoCo V3☆32Updated 3 years ago