ruotianluo / rtutilsLinks
☆17Updated 2 years ago
Alternatives and similar repositories for rtutils
Users that are interested in rtutils are comparing it to the libraries listed below
Sorting:
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆63Updated 5 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 10 months ago
- For visual commonsense model☆34Updated 6 years ago
- ☆19Updated 6 years ago
- The project is about predicting sets (of classes) from images.☆23Updated 4 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Updated 6 years ago
- Code of "Visualizing and Understanding Object Detecor"☆20Updated 4 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 5 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 4 years ago
- ☆42Updated 5 years ago
- A ShuffleBatchNorm layer to shuffle BatchNorm statistics across multiple GPUs☆56Updated 3 years ago
- Video Noise Contrastive Estimation☆66Updated 2 years ago
- ☆35Updated 2 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- ☆20Updated 4 years ago
- Parametric Instance Classification for Unsupervised Visual Feature Learning, NeurIPS 2020☆52Updated 4 years ago
- An pytorch implementation of our NeurIPS paper of PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph☆53Updated 2 years ago
- Code for reproducing experiments in "How Useful is Self-Supervised Pretraining for Visual Tasks?"☆60Updated last year
- Vision Longformer For Object Detection☆34Updated 4 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 4 years ago
- ☆73Updated 3 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Updated last year
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 3 years ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 4 years ago
- Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020☆42Updated 5 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Updated 3 years ago
- When can you tell whether an image has been cropped or not?☆29Updated 4 years ago
- Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)☆58Updated 6 years ago