JuanFMontesinos / PyNVIdeoReader
GPU-accelerated video decoder
☆20Updated 3 years ago
Alternatives and similar repositories for PyNVIdeoReader:
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- ☆34Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆99Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- ☆52Updated last year
- ☆69Updated last year
- [CVPR2023] Code for "Streaming Video Model"☆78Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆56Updated 2 years ago
- Code for the Video Similarity Challenge.☆77Updated last year
- ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.☆15Updated 3 years ago
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 2 years ago
- ☆48Updated 8 months ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆98Updated 2 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆72Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- ☆8Updated 2 years ago
- ☆66Updated 2 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆73Updated 8 months ago
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆41Updated last month
- Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]☆42Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆44Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- ☆29Updated last year
- ☆34Updated 3 years ago
- ☆17Updated 10 months ago
- ☆31Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago