buxiangzhiren/VD-IT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/buxiangzhiren/VD-IT)

buxiangzhiren / VD-IT

Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024

☆48

Alternatives and similar repositories for VD-IT

Users that are interested in VD-IT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

heshuting555 / DsHmp
View on GitHub
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
☆83Jul 24, 2024Updated 2 years ago
cilinyan / ReVOS-api
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆22Jul 20, 2024Updated 2 years ago
RobertLuo1 / NeurIPS2023_SOC
View on GitHub
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆33Mar 16, 2024Updated 2 years ago
bo-miao / SgMg
View on GitHub
[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
☆128Apr 9, 2025Updated last year
OpenGVLab / MUTR
View on GitHub
「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation
☆85Jun 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
haochenheheda / LVVIS
View on GitHub
Large-Vocabulary Video Instance Segmentation dataset
☆100Jul 5, 2024Updated 2 years ago
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
Tavarich / Awesome-Referring-Video-Object-Segmentation
View on GitHub
A list of referring video object segmentation papers
☆63Jun 28, 2026Updated last month
wudongming97 / OnlineRefer
View on GitHub
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
☆58Oct 7, 2023Updated 2 years ago
gaomingqi / Awesome-Video-Object-Segmentation
View on GitHub
🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.
☆516Jul 13, 2026Updated 2 weeks ago
cilinyan / VISA
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆214Aug 5, 2024Updated last year
appletea233 / AL-Ref-SAM2
View on GitHub
[AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…
☆93Dec 23, 2024Updated last year
Hydragon516 / DPA
View on GitHub
[CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation
☆38Apr 21, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jabir-zheng / MMoT-Transformer
View on GitHub
A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".
☆12Jan 16, 2023Updated 3 years ago
bo-miao / HTR
View on GitHub
[TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
☆19Apr 9, 2025Updated last year
ut-vision / ActionVOS
View on GitHub
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆32Dec 4, 2024Updated last year
KainingYing / CTVIS
View on GitHub
[ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation
☆83Oct 15, 2023Updated 2 years ago
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
Tapall-AI / MeViS_Track_Solution_2024
View on GitHub
[CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
☆31Oct 18, 2024Updated last year
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
GLUS-video / GLUS
View on GitHub
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆70Jun 23, 2025Updated last year
QianWangX / VidSeg_diffusion
View on GitHub
Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]
☆60Feb 27, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ClaudiaCuttano / SAMWISE
View on GitHub
[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆386Sep 25, 2025Updated 10 months ago
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
dzh19990407 / LBDT
View on GitHub
CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
☆24Aug 12, 2022Updated 3 years ago
suhwan-cho / FindTrack
View on GitHub
[ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
☆82Oct 22, 2025Updated 9 months ago
aspirinone / CATR.github.io
View on GitHub
☆31Mar 1, 2024Updated 2 years ago
ziplab / MPVSS
View on GitHub
☆33Feb 29, 2024Updated 2 years ago
CUHK-AIM-Group / CLIFF
View on GitHub
[ECCV' 24 Oral] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
☆32Sep 26, 2024Updated last year
jiaosiyu1999 / MAFT-Plus
View on GitHub
☆60Sep 14, 2024Updated last year
asudahkzj / Wnet
View on GitHub
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
☆24Sep 6, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
UCSB-AI / via-video
View on GitHub
☆25May 12, 2026Updated 2 months ago
lxa9867 / R2VOS
View on GitHub
Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]
☆30Mar 13, 2024Updated 2 years ago
srijandas07 / clip_baseline_LTA_Ego4d
View on GitHub
Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)
☆15Jul 4, 2022Updated 4 years ago
wwangcece / CRefDiff
View on GitHub
[JSTARS 2026] Controllable Reference-Guided Diffusion with Local–Global Fusion for Real-World Remote Sensing Image Super-Resolution
☆17Apr 28, 2026Updated 3 months ago
qjy981010 / cocoapi
View on GitHub
COCO API Customized for OVIS evaluation
☆17Nov 8, 2021Updated 4 years ago
fanghaook / LBVQ
View on GitHub
Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)
☆26Apr 2, 2024Updated 2 years ago
ruohaoguo / ovavss
View on GitHub
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
☆37Nov 2, 2024Updated last year