amazon-science / gluonmm
A library of transformer models for computer vision and multi-modality research
☆49Updated 3 years ago
Alternatives and similar repositories for gluonmm:
Users that are interested in gluonmm are comparing it to the libraries listed below
- Video Noise Contrastive Estimation☆66Updated last year
- Implementation of momentum^2 teacher☆121Updated 4 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆31Updated 3 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆54Updated 3 years ago
- ☆34Updated 2 years ago
- ☆44Updated 3 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- PIC Challenge Baseline☆19Updated 6 years ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- 1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking☆80Updated 4 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆48Updated 3 years ago
- ☆73Updated 2 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 3 years ago
- Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.☆56Updated 3 years ago
- ☆40Updated 2 years ago
- When can you tell whether an image has been cropped or not?☆29Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022☆15Updated 2 years ago
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆52Updated 4 years ago
- ☆34Updated 2 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 3 years ago
- The official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Effi…☆53Updated 4 years ago
- Channel Equilibrium Networks for Learning Deep Representation, ICML2020☆22Updated 4 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago