JuanFMontesinos / PyNVIdeoReaderLinks
GPU-accelerated video decoder
☆20Updated 4 years ago
Alternatives and similar repositories for PyNVIdeoReader
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
Sorting:
- [CVPR2023] Code for "Streaming Video Model"☆78Updated 2 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆150Updated 2 years ago
- ☆72Updated 2 years ago
- ☆56Updated 3 years ago
- Code for the Video Similarity Challenge.☆80Updated last year
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆48Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆159Updated 3 years ago
- ☆86Updated last year
- Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]☆45Updated 2 years ago
- ☆178Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Updated 5 months ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 4 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆231Updated 3 years ago
- ☆109Updated 2 years ago
- ☆68Updated 2 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆56Updated 2 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated last year
- Visualizing the learned space-time attention using Attention Rollout☆36Updated 3 years ago
- Datasets, transforms and samplers for video in PyTorch☆88Updated 2 years ago
- PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529☆164Updated 3 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆106Updated last year
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆102Updated 2 years ago
- ☆26Updated 2 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆88Updated 4 years ago
- ☆32Updated 2 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Updated 4 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- cuda implementation of depthwise conv3d☆22Updated 4 years ago