lichengunc / mask-faster-rcnn
Mask R-CNN
☆59Updated 7 years ago
Alternatives and similar repositories for mask-faster-rcnn:
Users that are interested in mask-faster-rcnn are comparing it to the libraries listed below
- ☆25Updated 7 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Updated 6 years ago
- Code for training temporal fully-connected CRF models in Torch☆68Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)☆58Updated 5 years ago
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆109Updated 5 years ago
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018☆76Updated 3 years ago
- Reimplementation for Iterative Visual Reasoning Beyond Convolutions(CVPR2018),i've reimplemented it on pytorch according to [endernewton/…☆71Updated 6 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 6 years ago
- SST: Single-Stream Temporal Action Proposal☆67Updated 7 years ago
- Learning to Evaluate Image Captioning. CVPR 2018☆84Updated 6 years ago
- Rethinking the Form of Latent States in Image Captioning☆21Updated 6 years ago
- Implementation for our paper "Phrase Localization and Visual Relationship Detection with Comprehensive Image-Language Cues."☆40Updated 7 years ago
- This is our PyTorch implementation of Multi-level Scene Description Network (MSDN) proposed in our ICCV 2017 paper.☆227Updated 5 years ago
- Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018☆171Updated 6 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆22Updated 7 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Updated 6 years ago
- ☆77Updated 6 years ago
- Code for "Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation" (CVPR 2018)☆84Updated 6 years ago
- Implementation for the AAAI2019 paper "Large-scale Visual Relationship Understanding"☆145Updated 5 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Updated 6 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆57Updated 5 years ago
- A PyTorch implementation of the "Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection" paper …☆63Updated 6 years ago
- ☆25Updated 7 years ago
- Lua☆57Updated 6 years ago
- Visual Relationship Detection☆112Updated 3 years ago
- Soft Proposal Networks for Weakly Supervised Object Localization, in ICCV 2017☆94Updated 7 years ago
- Implementation for our paper "Conditional Image-Text Embedding Networks"☆39Updated 4 years ago
- Implementation of our ICCV 2019 paper "Cap2Det: Learning to AmplifyWeak Caption Supervision for Object Detection"☆29Updated 2 years ago