lwye/CMSA-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lwye/CMSA-Net)

lwye / CMSA-Net

Cross-Modal Self-Attention Network for Referring Image Segmentation cvpr19

☆57

Alternatives and similar repositories for CMSA-Net

Users that are interested in CMSA-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liruiyu / referseg_rrn
View on GitHub
☆31Jul 26, 2019Updated 7 years ago
wenz116 / lang2seg
View on GitHub
Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019
☆31Apr 21, 2021Updated 5 years ago
chenxi116 / TF-phrasecut-public
View on GitHub
☆38Jul 23, 2017Updated 9 years ago
BCV-Uniandes / DMS
View on GitHub
Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018
☆76Sep 21, 2021Updated 4 years ago
usr922 / vgtr
View on GitHub
[ICME'22] Visual Grounding with Transformers
☆28May 27, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lichengunc / refer-parser2
View on GitHub
Referring Expression Parser
☆27Feb 10, 2018Updated 8 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
massens / salnet-keras
View on GitHub
SalNet on Keras: A deep convolutional network for saliency prediction
☆11Jun 23, 2017Updated 9 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
miriambellver / refvos
View on GitHub
RefVOS
☆28Feb 3, 2021Updated 5 years ago
lichengunc / MAttNet
View on GitHub
MAttNet: Modular Attention Network for Referring Expression Comprehension
☆299Nov 29, 2022Updated 3 years ago
ccvl / clevr-refplus-dataset-gen
View on GitHub
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆26Jan 20, 2022Updated 4 years ago
lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
luogen1996 / MCN
View on GitHub
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
☆139Aug 4, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fengguang94 / CEFNet
View on GitHub
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021
☆21Aug 17, 2021Updated 4 years ago
nku-shengzheliu / Pytorch-TransVG
View on GitHub
An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".
☆50Jun 7, 2021Updated 5 years ago
mjhucla / Google_Refexp_toolbox
View on GitHub
The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283
☆166Mar 1, 2017Updated 9 years ago
ChenyunWu / PhraseCutDataset
View on GitHub
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
☆116Mar 28, 2026Updated 3 months ago
yz93 / LAVT-RIS
View on GitHub
☆234Apr 13, 2023Updated 3 years ago
jianzongwu / robust-ref-seg
View on GitHub
(TIP 2024) Towards Robust Referring Image Segmentation
☆40Mar 2, 2024Updated 2 years ago
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago
MarkMoHR / Awesome-Referring-Image-Segmentation
View on GitHub
A collection of papers about Referring Image Segmentation.
☆826Jan 28, 2026Updated 5 months ago
yunyikristy / skipNet
View on GitHub
☆12Oct 21, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
christophschuhmann / 4MC-4M-Image-Text-Pairs-with-CLIP-embeddings
View on GitHub
I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…
☆17Apr 22, 2021Updated 5 years ago
perceivelab / hd2s
View on GitHub
The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"
☆27Mar 13, 2023Updated 3 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆50Aug 31, 2021Updated 4 years ago
lil-lab / drif
View on GitHub
Dynamic Robot Instruction Following
☆42Dec 28, 2021Updated 4 years ago
pxg / S3-image-compression
View on GitHub
S3 automatic lossless image compression
☆10Aug 28, 2015Updated 10 years ago
dukebw / SSTVOS
View on GitHub
Training code for "SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"
☆88Nov 21, 2021Updated 4 years ago
hassanhub / MultiGrounding
View on GitHub
This is the repo for Multi-level textual grounding
☆34Jul 21, 2020Updated 6 years ago
jiwei0921 / DCF
View on GitHub
Code for CVPR 2021 paper. "Calibrated RGB-D Salient Object detection".
☆45Oct 20, 2021Updated 4 years ago
jacobswan1 / MTG-pytorch
View on GitHub
Gender/Age attribute grounding using weak supervised manner.
☆12Jun 23, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bighuang624 / AGAM
View on GitHub
Code for the AAAI 2021 paper "Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition".
☆10Nov 21, 2022Updated 3 years ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
IVIPLab / FDIWN
View on GitHub
This repository is an official PyTorch implementation of our paper "Feature Distillation Interaction Weighting Network for Lightweight Im…
☆21May 10, 2023Updated 3 years ago
jayanthkoushik / cmu-ammml-project
View on GitHub
Project for the Advanced Multimodal Machine Learning course at CMU.
☆14May 14, 2016Updated 10 years ago
vmlaker / benchmark-sharedmem
View on GitHub
Performance test of NumPy shared memory module
☆14Mar 8, 2016Updated 10 years ago
hanhung / TGNN
View on GitHub
☆26Mar 15, 2022Updated 4 years ago
houph4 / Efficient6D-SAM-VLM-to-Grasping-task
View on GitHub
Our repo containes a Efficient RGB-D features extractor to category-level and instance-level 6D pose estimation.
☆15Oct 29, 2025Updated 8 months ago