JuanFMontesinos / PyNVIdeoReader
GPU-accelerated video decoder
☆21Updated 3 years ago
Alternatives and similar repositories for PyNVIdeoReader:
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆87Updated 7 months ago
- Implementations of Transformers for Video☆23Updated 3 years ago
- ☆54Updated 2 years ago
- ☆48Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆110Updated 2 weeks ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 3 years ago
- [CVPR2023] Code for "Streaming Video Model"☆78Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- ☆26Updated last year
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆145Updated 3 years ago
- ☆17Updated 2 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆49Updated 8 months ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆73Updated 9 months ago
- ☆29Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆51Updated last year
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Updated 3 years ago
- ☆44Updated 3 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- ☆54Updated 3 years ago
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆15Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- t-vMF Similarity for Regularizing Intra-Class Feature Distribution☆21Updated 3 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated last year