JuanFMontesinos / PyNVIdeoReaderLinks
GPU-accelerated video decoder
☆20Updated 4 years ago
Alternatives and similar repositories for PyNVIdeoReader
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
Sorting:
- ☆72Updated 2 years ago
- [CVPR2023] Code for "Streaming Video Model"☆78Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆102Updated 2 years ago
- Code for the Video Similarity Challenge.☆80Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 3 years ago
- ☆57Updated 3 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆151Updated 2 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆91Updated 7 months ago
- ☆180Updated 3 years ago
- Datasets, transforms and samplers for video in PyTorch☆88Updated 2 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 4 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆232Updated 3 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 4 years ago
- ☆69Updated 3 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆135Updated 4 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆93Updated last year
- ViT trained on COYO-Labeled-300M dataset☆33Updated 2 years ago
- CLIP-It! Language-Guided Video Summarization☆75Updated 4 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆161Updated 3 years ago
- ☆31Updated 4 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Updated last year
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- ☆110Updated 2 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆79Updated last year
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆50Updated 3 years ago
- ☆47Updated 3 years ago
- PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529☆164Updated 3 years ago