JuanFMontesinos / PyNVIdeoReaderLinks
GPU-accelerated video decoder
☆20Updated 4 years ago
Alternatives and similar repositories for PyNVIdeoReader
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
Sorting:
- ☆71Updated 2 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 4 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆152Updated 3 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆135Updated 4 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆232Updated 3 years ago
- ☆69Updated 3 years ago
- Code for the Video Similarity Challenge.☆81Updated last year
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆223Updated 3 years ago
- Datasets, transforms and samplers for video in PyTorch☆88Updated 2 years ago
- ☆180Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆161Updated 3 years ago
- Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]☆45Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆93Updated last year
- [CVPR2023] Code for "Streaming Video Model"☆79Updated 2 years ago
- ☆58Updated 3 weeks ago
- ☆47Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆91Updated 8 months ago
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆48Updated 3 years ago
- ☆86Updated last year
- ☆54Updated 4 years ago
- PyTorch implementation of X3D models with Multigrid training.☆100Updated 4 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated 2 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆49Updated 4 months ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Updated 4 years ago
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆69Updated 2 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆79Updated last year
- Visualizing the learned space-time attention using Attention Rollout☆40Updated 3 years ago
- Repo for the Video Person Clustering dataset, and code for the associated paper☆54Updated 3 years ago