JuanFMontesinos / PyNVIdeoReaderLinks
GPU-accelerated video decoder
☆20Updated 4 years ago
Alternatives and similar repositories for PyNVIdeoReader
Users that are interested in PyNVIdeoReader are comparing it to the libraries listed below
Sorting:
- ☆71Updated 2 years ago
- Code for the Video Similarity Challenge.☆80Updated 2 years ago
- ☆69Updated 3 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆153Updated 3 years ago
- ☆180Updated 3 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Updated 9 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆104Updated 2 years ago
- [CVPR2023] Code for "Streaming Video Model"☆79Updated 2 years ago
- ☆58Updated 2 months ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 3 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Updated 2 years ago
- ☆110Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆94Updated last year
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Updated last month
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆49Updated 5 months ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆55Updated 3 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 4 years ago
- Visualizing the learned space-time attention using Attention Rollout☆40Updated 3 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆78Updated 3 years ago
- ☆87Updated last year
- Video Contrastive Learning with Global Context, ICCVW 2021☆162Updated 3 years ago
- PyTorch code for MUST☆108Updated 9 months ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 4 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Updated 3 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- Code repository for "It's About Time: Analog clock Reading in the Wild"☆79Updated last year
- ViT trained on COYO-Labeled-300M dataset☆33Updated 3 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆232Updated 3 years ago