JuanFMontesinos / PyNVIdeoReader
GPU-accelerated video decoder
☆21Updated 3 years ago
Alternatives and similar repositories for PyNVIdeoReader
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
Sorting:
- ☆55Updated 2 years ago
- [CVPR2023] Code for "Streaming Video Model"☆78Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆110Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆89Updated last month
- Implementations of Transformers for Video☆23Updated 4 years ago
- Code for the Video Similarity Challenge.☆78Updated last year
- ☆48Updated 3 years ago
- ☆108Updated 2 years ago
- ☆29Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆65Updated 2 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Updated 3 years ago
- ☆26Updated last year
- ☆73Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆118Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- ☆175Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆108Updated last year
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Updated 7 months ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆84Updated last year
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- ☆66Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆101Updated last year
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- ☆31Updated 3 years ago