Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19
☆56Sep 11, 2019Updated 6 years ago
Alternatives and similar repositories for CMSA-Net
Users that are interested in CMSA-Net are comparing it to the libraries listed below
Sorting:
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆63Feb 2, 2021Updated 5 years ago
- Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.☆15Oct 2, 2020Updated 5 years ago
- ☆30Jul 26, 2019Updated 6 years ago
- 'Bi-directional Relationship Inferring Network for Referring Image Segmentation' CVPR2020☆18Apr 2, 2022Updated 3 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019☆31Apr 21, 2021Updated 4 years ago
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018☆76Sep 21, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆38Jul 23, 2017Updated 8 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- ☆45Oct 3, 2023Updated 2 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- ☆12Oct 21, 2019Updated 6 years ago
- SalNet on Keras: A deep convolutional network for saliency prediction☆11Jun 23, 2017Updated 8 years ago
- RefVOS☆29Feb 3, 2021Updated 5 years ago
- An MRI-pathology model (MRI-based Predicted Transformer for Prostate cancer (MRI-PTPCa)) was proposed to discover correlations between mp…☆35Feb 5, 2025Updated last year
- Recursive Neural Networks implemented with Tensorflow☆13Nov 5, 2019Updated 6 years ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆298Nov 29, 2022Updated 3 years ago
- Dynamic Robot Instruction Following☆39Dec 28, 2021Updated 4 years ago
- Referring Expression Datasets API☆561Aug 27, 2024Updated last year
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆113May 13, 2020Updated 5 years ago
- ☆15Mar 20, 2020Updated 5 years ago
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆166Mar 1, 2017Updated 9 years ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆19Aug 17, 2021Updated 4 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- Joint Modelling Histology and Molecular Markers for Glioma Classification☆12Jun 4, 2025Updated 9 months ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆122Oct 3, 2023Updated 2 years ago
- Deep Active Contour Network for Medical Image Segmentation☆20Nov 16, 2020Updated 5 years ago
- Phrase Localization Evaluation Toolkit☆20Aug 16, 2019Updated 6 years ago
- ☆221Apr 13, 2023Updated 2 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Nov 4, 2020Updated 5 years ago
- The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"☆27Mar 13, 2023Updated 2 years ago
- ☆32Mar 31, 2023Updated 2 years ago
- A PyTorch implementation of TVC☆24Dec 18, 2023Updated 2 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- Code for the VOST dataset☆26Oct 1, 2023Updated 2 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- [IEEE TPAMI22] MobileSal: Extremely Efficient RGB-D Salient Object Detection [PyTorch & Jittor]☆66Sep 22, 2025Updated 5 months ago