amazon-science / gluonmm
A library of transformer models for computer vision and multi-modality research
☆49Updated 3 years ago
Alternatives and similar repositories for gluonmm:
Users that are interested in gluonmm are comparing it to the libraries listed below
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- Video Noise Contrastive Estimation☆66Updated last year
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- 1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking☆80Updated 4 years ago
- Implementation of momentum^2 teacher☆121Updated 4 years ago
- ☆34Updated 2 years ago
- PIC Challenge Baseline☆19Updated 6 years ago
- ☆31Updated 3 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- ☆50Updated last year
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆54Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆44Updated 3 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 5 years ago
- [ICCV 2019 Oral] TA3N: https://github.com/cmhungsteve/TA3N (Most updated repo)☆45Updated 5 months ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- MIST: Multiple Instance Spatial Transformer☆25Updated 3 years ago
- ☆47Updated 5 years ago
- Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.☆56Updated 3 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 3 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆84Updated 4 years ago
- The official Codes for NeurIPS 2019 paper. Quanfu Fan, Ricarhd Chen, Hilde Kuehne, Marco Pistoia, David Cox, "More Is Less: Learning Effi…☆53Updated 4 years ago
- Vision Longformer For Object Detection☆35Updated 3 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 3 years ago
- ☆16Updated 4 years ago
- Implementation of NeurIPS2020 paper: Auto-Panoptic: Multi-Component Architecture Search for Panoptic Segmentation☆20Updated 4 years ago
- Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022☆15Updated 2 years ago
- ☆41Updated 2 years ago