RobertLuo1 / HDC

The official implementation of Hierarchical Semantic Decoding with Counting Assitance for Generalized Referring Expression Segmentation

☆16

Related projects ⓘ

Alternatives and complementary repositories for HDC

RobertLuo1 / NeurIPS2023_SOC
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆28Updated 8 months ago
nini0919 / DiffPNG
[ECCV2024]The official implementation of the DiffPNG paper in PyTorch.
☆11Updated last month
FeipengMa6 / VLoRA
[NeurIPS 2024] Visual Perception by Large Language Model’s Weights
☆29Updated last month
buxiangzhiren / VD-IT
☆33Updated last month
showlab / VideoLISA
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆33Updated 2 weeks ago
Rubics-Xuan / MRES
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…
☆64Updated 5 months ago
yongliu20 / SCAN
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆61Updated last month
ruohaoguo / ovavss
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
☆14Updated 2 weeks ago
cilinyan / VISA
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆133Updated 3 months ago
clownrat6 / OpenVIS
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
☆17Updated 8 months ago
Yxxxb / LAVT-RS
[TPAMI2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆17Updated 2 months ago
SooLab / CGFormer
The official PyTorch implementation of the CVPR 2023 paper "Contrastive Grouping with Transformer for Referring Image Segmentation".
☆43Updated 7 months ago
LeapLabTHU / GSVA
[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models
☆93Updated 2 months ago
rkzheng99 / ViLLa
Video Reasoning Segmentation
☆16Updated 4 months ago
fanghaook / Awesome-Video-Instance-Segmentation
Awesome video instance segmentation papers
☆30Updated this week
yongliu20 / UniLSeg
[CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"
☆78Updated 8 months ago
HengLan / CGSTVG
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆42Updated 4 months ago
liuting20 / DARA
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆14Updated 3 weeks ago
mc-lan / Text4Seg
Text4Seg: Reimagining Image Segmentation as Text Generation
☆23Updated last month
z-x-yang / DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆75Updated 2 months ago
double125 / MADTP
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
☆33Updated 2 months ago
jianzongwu / robust-ref-seg
(TIP 2024) Towards Robust Referring Image Segmentation
☆21Updated 8 months ago
kkakkkka / ETRIS
[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
☆96Updated 9 months ago
Tavarich / Awesome-Referring-Video-Object-Segmentation
A list of referring video object segmentation papers
☆17Updated this week
johncaged / OPT_Questioner
Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"
☆15Updated last year
HVision-NKU / Cascade-CLIP
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
☆39Updated 3 months ago
lyk412 / Consistent123
[ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors
☆20Updated last month
EasonXiao-888 / UVCOM
[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
☆75Updated 4 months ago
kodenii / Ref-Diff
☆16Updated last year