facebookresearch / GliTrLinks
GliTr Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
☆25Updated 2 years ago
Alternatives and similar repositories for GliTr
Users that are interested in GliTr are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated 2 years ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆27Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆132Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆102Updated 2 years ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆83Updated 2 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- ☆36Updated 2 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆94Updated 3 years ago
- understanding model mistakes with human annotations☆106Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated last year
- Unofficial PyTorch implementation of TokenLearner by Google AI☆65Updated 2 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated last year
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆46Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆315Updated 2 years ago
- Video descriptions of research papers relating to foundation models and scaling☆31Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- An open source implementation of CLIP.☆33Updated 2 years ago
- ☆65Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆57Updated last year
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆80Updated 2 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 4 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆49Updated last month
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago