amazon-science / gluonmmLinks
A library of transformer models for computer vision and multi-modality research
☆49Updated 4 years ago
Alternatives and similar repositories for gluonmm
Users that are interested in gluonmm are comparing it to the libraries listed below
Sorting:
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆63Updated 5 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated 2 years ago
- Implementation of momentum^2 teacher☆121Updated 4 years ago
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆52Updated 4 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆55Updated 4 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 5 years ago
- Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.☆57Updated 3 years ago
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Updated 4 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 4 years ago
- ☆34Updated 3 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Updated 4 years ago
- PIC Challenge Baseline☆18Updated 7 years ago
- 1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking☆82Updated 4 years ago
- Latex style file to facilitate writing of technical papers☆37Updated 9 years ago
- A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.☆77Updated 4 years ago
- Vision Longformer For Object Detection☆34Updated 4 years ago
- Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)☆148Updated 4 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 5 years ago
- ☆31Updated 4 years ago
- This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".☆121Updated 4 years ago
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆121Updated 4 years ago
- [ICCV 2019 Oral] TA3N: https://github.com/cmhungsteve/TA3N (Most updated repo)☆45Updated last year
- Code accompanying the ICLR 2020 submission: Learning a Spatio-Temporal Embedding for Video Instance Segmentation.☆19Updated 6 years ago
- Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022☆15Updated 3 years ago
- ☆74Updated 3 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆47Updated 4 years ago
- Official repository of CVPR 2020 paper "On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location"☆70Updated 4 years ago
- Collections of self-supervised methods, based on cvpods.☆58Updated 4 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162Updated 3 years ago