RobertLuo1/NeurIPS2023_SOC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RobertLuo1/NeurIPS2023_SOC)

RobertLuo1 / NeurIPS2023_SOC

[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation

☆33

Alternatives and similar repositories for NeurIPS2023_SOC

Users that are interested in NeurIPS2023_SOC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cilinyan / ReVOS-api
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆22Jul 20, 2024Updated 2 years ago
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
lxa9867 / R2VOS
View on GitHub
Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]
☆30Mar 13, 2024Updated 2 years ago
clownrat6 / OpenVIS
View on GitHub
[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
☆26Dec 30, 2024Updated last year
EasonXiao-888 / UVCOM
View on GitHub
[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
☆117Jul 17, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
heshuting555 / DsHmp
View on GitHub
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
☆83Jul 24, 2024Updated 2 years ago
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
RobertLuo1 / iccv2023_RVOS_Challenge
View on GitHub
[ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition
☆14Jan 1, 2024Updated 2 years ago
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
asudahkzj / Wnet
View on GitHub
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
☆24Sep 6, 2022Updated 3 years ago
lsa1997 / CARIS
View on GitHub
Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]
☆30Nov 28, 2024Updated last year
jianzongwu / robust-ref-seg
View on GitHub
(TIP 2024) Towards Robust Referring Image Segmentation
☆40Mar 2, 2024Updated 2 years ago
yoxu515 / VIPOSeg-Benchmark
View on GitHub
The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".
☆12Oct 17, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dzh19990407 / LBDT
View on GitHub
CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
☆24Aug 12, 2022Updated 3 years ago
janghyuncho / DECOLA
View on GitHub
Code release for "Language-conditioned Detection Transformer"
☆86Jun 17, 2024Updated 2 years ago
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
KimHanjung / VISAGE
View on GitHub
[ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
☆38Jul 29, 2024Updated 2 years ago
yk-pku / Two-shot-Video-Object-Segmentation
View on GitHub
This work is accepted by CVPR2023
☆36Sep 20, 2023Updated 2 years ago
sukjunhwang / VITA
View on GitHub
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
☆107Jan 4, 2024Updated 2 years ago
HYUNJS / STOV-TAL
View on GitHub
[WACV-2025] Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
☆17May 28, 2025Updated last year
aquastripe / DenseCLIP
View on GitHub
An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"
☆24Jan 27, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Ali2500 / TarViS
View on GitHub
☆50Jul 5, 2023Updated 3 years ago
florinshen / ULAST
View on GitHub
This is the official repo of "Unsupervised Learning of Accurate Siamese Tracking"
☆19Mar 25, 2022Updated 4 years ago
ut-vision / ActionVOS
View on GitHub
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆32Dec 4, 2024Updated last year
bo-miao / HTR
View on GitHub
[TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
☆19Apr 9, 2025Updated last year
noelshin / zutis
View on GitHub
[CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation
☆24Aug 22, 2023Updated 2 years ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated 2 weeks ago
cilinyan / VISA
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆214Aug 5, 2024Updated last year
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
zwq456 / CLIP-VIS
View on GitHub
[IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.
☆48Jan 7, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
sallymmx / TransVOS
View on GitHub
This is an official implementation of TransVOS
☆31Jul 2, 2026Updated 3 weeks ago
HYUNJS / DecAF
View on GitHub
[ICLR 2026] Official implementation of "Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation"
☆36Jan 26, 2026Updated 6 months ago
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
hbdat / cvpr22_cross_modal_pseudo_labeling
View on GitHub
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22
☆44Oct 10, 2022Updated 3 years ago
Hon-Wong / Elysium
View on GitHub
[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM
☆89Oct 25, 2024Updated last year
dahyun-kang / lavg
View on GitHub
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
☆51Sep 24, 2024Updated last year
helblazer811 / RefSAM
View on GitHub
Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)
☆39Apr 7, 2023Updated 3 years ago