Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19
☆56Sep 11, 2019Updated 6 years ago
Alternatives and similar repositories for CMSA-Net
Users that are interested in CMSA-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆63Feb 2, 2021Updated 5 years ago
- Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.☆15Oct 2, 2020Updated 5 years ago
- ☆30Jul 26, 2019Updated 6 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019☆31Apr 21, 2021Updated 4 years ago
- 'Bi-directional Relationship Inferring Network for Referring Image Segmentation' CVPR2020☆18Apr 2, 2022Updated 3 years ago
- ☆45Oct 3, 2023Updated 2 years ago
- ☆38Jul 23, 2017Updated 8 years ago
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018☆76Sep 21, 2021Updated 4 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- Code for TGRS 2021 paper. Edge-Aware Multiscale Feature Integration Network for Salient Object Detection in Optical Remote Sensing Images…☆13Apr 6, 2022Updated 3 years ago
- SalNet on Keras: A deep convolutional network for saliency prediction☆11Jun 23, 2017Updated 8 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- RefVOS☆28Feb 3, 2021Updated 5 years ago
- [ICME'22] Visual Grounding with Transformers☆28May 27, 2022Updated 3 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- Referring Expression Datasets API☆562Aug 27, 2024Updated last year
- [CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)☆139Aug 4, 2022Updated 3 years ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆19Aug 17, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆166Mar 1, 2017Updated 9 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆113May 13, 2020Updated 5 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆52Jun 7, 2021Updated 4 years ago
- Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges☆193Feb 26, 2023Updated 3 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆358Jan 7, 2022Updated 4 years ago
- Refer-Youtube-VOS dataset☆27Mar 10, 2026Updated 2 weeks ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- Deep Active Contour Network for Medical Image Segmentation☆20Nov 16, 2020Updated 5 years ago
- ☆10May 10, 2018Updated 7 years ago
- Simple vs complex temporal recurrences for video saliency prediction (BMVC 2019)☆26Nov 22, 2022Updated 3 years ago
- ☆12Oct 21, 2019Updated 6 years ago
- A collection of papers about Referring Image Segmentation.☆812Jan 28, 2026Updated last month
- The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"☆27Mar 13, 2023Updated 3 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆50Aug 31, 2021Updated 4 years ago
- An MRI-pathology model (MRI-based Predicted Transformer for Prostate cancer (MRI-PTPCa)) was proposed to discover correlations between mp…☆36Feb 5, 2025Updated last year
- Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.☆116Aug 10, 2020Updated 5 years ago
- This is the repo for Multi-level textual grounding☆34Jul 21, 2020Updated 5 years ago
- Training code for "SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"☆87Nov 21, 2021Updated 4 years ago
- EvaluationToolBox for Camouflaged Object Detection Task☆58Dec 4, 2020Updated 5 years ago