facebookresearch / GDT
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.
☆44Updated 3 years ago
Related projects: ⓘ
- ☆74Updated 2 years ago
- ☆31Updated 2 years ago
- ☆42Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Code for SelfAugment☆27Updated 3 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Updated 2 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated last year
- a pytorch implementation for MoCo V3☆31Updated 3 years ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆25Updated 2 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆86Updated 2 years ago
- ☆31Updated this week
- This is a offical PyTorch/GPU implementation of SupMAE.☆76Updated 2 years ago
- ☆44Updated 2 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".☆43Updated 2 years ago
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33Updated 2 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆59Updated 2 years ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 2 years ago
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆30Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 4 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 2 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆34Updated 2 years ago
- ☆22Updated last month
- [ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …☆22Updated 2 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆64Updated 2 years ago
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆53Updated 3 years ago
- [ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang☆63Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆105Updated 9 months ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆32Updated last year