facebookresearch / GliTr
GliTr Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GliTr
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆57Updated last year
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations"☆31Updated 11 months ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆80Updated 3 months ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".☆93Updated 5 months ago
- Timm model explorer☆36Updated 7 months ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆37Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆52Updated last month
- More dimensions = More fun☆21Updated 3 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated 10 months ago
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Updated 11 months ago
- SSL Video Representation Learning project☆10Updated 11 months ago
- ☆12Updated 2 months ago
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆26Updated last year
- ☆50Updated 5 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆36Updated last month
- Python Tools for Visual Dataset Transformation☆26Updated last week
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆30Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆44Updated last year
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆31Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆13Updated 3 months ago