facebookresearch / GliTr
GliTr Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
☆25Updated 2 years ago
Alternatives and similar repositories for GliTr:
Users that are interested in GliTr are comparing it to the libraries listed below
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆60Updated last year
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆30Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated last year
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆46Updated last year
- Timm model explorer☆39Updated last year
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆48Updated 10 months ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆102Updated 10 months ago
- Implementation for the CVPR 2023 paper "Improving Selective Visual Question Answering by Learning from Your Peers" (https://arxiv.org/abs…☆24Updated last year
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆89Updated last week
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆79Updated last year
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆110Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Python Tools for Visual Dataset Transformation☆26Updated last week
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 7 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- FID computation in Jax/Flax.☆27Updated 9 months ago
- Object-Region Video Transformers☆24Updated 3 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆32Updated last year
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆113Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated 2 years ago
- ☆51Updated 10 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated 10 months ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆28Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago