amazon-science / gluonmm
A library of transformer models for computer vision and multi-modality research
☆49Updated 3 years ago
Alternatives and similar repositories for gluonmm
Users that are interested in gluonmm are comparing it to the libraries listed below
Sorting:
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- ☆31Updated 3 years ago
- PIC Challenge Baseline☆19Updated 6 years ago
- 1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking☆80Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 4 years ago
- ☆44Updated 4 years ago
- ☆34Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- Implementation of momentum^2 teacher☆121Updated 4 years ago
- ☆73Updated 2 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 5 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Latex style file to facilitate writing of technical papers☆37Updated 9 years ago
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆52Updated 4 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆54Updated 3 years ago
- [AAAI 2020] Temporal Interlacing Network☆84Updated 4 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 3 years ago
- ☆41Updated 3 years ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆84Updated 4 years ago
- ☆16Updated 4 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- ☆35Updated 3 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆78Updated 6 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- cuda implementation of depthwise conv3d☆22Updated 3 years ago