JuanFMontesinos / PyNVIdeoReader
GPU-accelerated video decoder
☆19Updated 3 years ago
Related projects: ⓘ
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆37Updated 2 years ago
- Code for Temporal Data Augmentations (ECCVW 2020)☆35Updated 4 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆110Updated 5 months ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆144Updated 2 years ago
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆15Updated 3 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- ☆68Updated 11 months ago
- Implementations of Transformers for Video☆24Updated 3 years ago
- DeVIS: Making Deformable Transformers Work for Video Instance Segmentation☆39Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated last year
- cuda implementation of depthwise conv3d☆21Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆156Updated 2 years ago
- ☆34Updated 2 years ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆56Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognition☆32Updated 2 years ago
- ☆51Updated 2 years ago
- ☆33Updated 3 years ago
- ☆48Updated 2 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Updated 2 years ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- ☆52Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆75Updated last month
- ☆43Updated 3 years ago
- ☆8Updated 2 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆107Updated last year
- a pytorch implementation for MoCo V3☆31Updated 3 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆86Updated 3 years ago
- Code for the Video Similarity Challenge.☆74Updated 7 months ago
- ☆17Updated 5 months ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago