facebookresearch / GDT
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.
☆46Updated 3 years ago
Alternatives and similar repositories for GDT:
Users that are interested in GDT are comparing it to the libraries listed below
- ☆31Updated 3 years ago
- ☆73Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- ☆42Updated 4 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- [ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang☆63Updated 3 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆38Updated 3 years ago
- Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)☆45Updated 3 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- Code for our paper "Auxiliary Task Reweighting for Minimum-data Learning" (NeurIPS 2020)☆18Updated 4 years ago
- ☆44Updated 3 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".☆43Updated 2 years ago
- Code for SelfAugment☆27Updated 4 years ago
- A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.☆78Updated 3 years ago
- MoCo with Alignment and Uniformity Loss.☆61Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- Compressing Representations for Self-Supervised Learning☆78Updated 4 years ago
- ☆36Updated 4 years ago
- ☆84Updated 2 years ago
- a pytorch implementation for MoCo V3☆32Updated 3 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆107Updated last year
- ReSSL: Relational Self-Supervised Learning with Weak Augmentation☆57Updated 3 years ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 3 years ago