facebookresearch / GDT
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.
☆47Updated 3 years ago
Alternatives and similar repositories for GDT:
Users that are interested in GDT are comparing it to the libraries listed below
- ☆73Updated 2 years ago
- ☆31Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- MoCo with Alignment and Uniformity Loss.☆62Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆38Updated 3 years ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".☆43Updated 2 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)☆45Updated 3 years ago
- ☆84Updated 3 years ago
- a pytorch implementation for MoCo V3☆32Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- This repository hosts the dataset and source code for "A causal view of compositional zero-shot recognition". Yuval Atzmon, Felix Kreuk, …☆27Updated 3 years ago
- ☆42Updated 4 years ago
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- [ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang☆63Updated 3 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- models and tools for -What makes ImageNet good for Transfer Learning?☆25Updated 7 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆34Updated 2 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆41Updated 2 years ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆42Updated 4 years ago
- [ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …☆22Updated 2 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆46Updated last year