JuanFMontesinos / PyNVIdeoReader
GPU-accelerated video decoder
☆20Updated 3 years ago
Alternatives and similar repositories for PyNVIdeoReader:
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Updated 3 years ago
- Code for the Video Similarity Challenge.☆77Updated 11 months ago
- [CVPR2023] Code for "Streaming Video Model"☆78Updated last year
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆76Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆51Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆31Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆81Updated 6 months ago
- ☆17Updated 9 months ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆87Updated 3 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated last year
- ☆69Updated last year
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆108Updated last year
- ☆17Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆98Updated last year
- Implementations of Transformers for Video☆23Updated 3 years ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆144Updated 3 years ago
- ☆105Updated 2 years ago
- (NeurIPS 2021) Pytorch implementation of paper "Re-ranking for image retrieval and transductive few-shot classification"☆31Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago
- ☆47Updated 7 months ago
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.☆50Updated 2 years ago
- ☆54Updated 2 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- LAEO-Net++☆20Updated 3 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- ☆171Updated 2 years ago