clownrat6/OpenVIS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/clownrat6/OpenVIS)

clownrat6 / OpenVIS

[AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.

☆26

Alternatives and similar repositories for OpenVIS

Users that are interested in OpenVIS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KimHanjung / VISAGE
View on GitHub
[ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
☆38Jul 29, 2024Updated last year
rain305f / OSP
View on GitHub
[CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning
☆22Jun 11, 2023Updated 3 years ago
RobertLuo1 / NeurIPS2023_SOC
View on GitHub
[NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
☆33Mar 16, 2024Updated 2 years ago
DAMO-NLP-SG / CMM
View on GitHub
✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
☆54Jul 11, 2025Updated last year
qpc1611094 / FPL
View on GitHub
Fuzzy Positive Learning (CVPR2023)
☆15Jul 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
clownrat6 / VectorNet
View on GitHub
The implementation of VectorNet. Done and Lose
☆41Jun 21, 2020Updated 6 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
haochenheheda / LVVIS
View on GitHub
Large-Vocabulary Video Instance Segmentation dataset
☆99Jul 5, 2024Updated 2 years ago
sukjunhwang / VITA
View on GitHub
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
☆107Jan 4, 2024Updated 2 years ago
fanghaook / LBVQ
View on GitHub
Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)
☆26Apr 2, 2024Updated 2 years ago
jpthu17 / HBI
View on GitHub
[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
☆125Dec 28, 2024Updated last year
HYUNJS / STOV-TAL
View on GitHub
[WACV-2025] Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
☆17May 28, 2025Updated last year
miranheo / GenVIS
View on GitHub
[CVPR'23] A Generalized Framework for Video Instance Segmentation
☆136Jan 4, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
KainingYing / CTVIS
View on GitHub
[ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation
☆83Oct 15, 2023Updated 2 years ago
fanghaook / Awesome-Video-Instance-Segmentation
View on GitHub
Awesome video instance segmentation papers
☆58Mar 12, 2026Updated 4 months ago
fanghaook / OVFormer
View on GitHub
[ECCV 2024] Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation
☆36Jan 6, 2025Updated last year
clownrat6 / Novel_Theft
View on GitHub
轻小说文库 epub 解析打包
☆21May 3, 2020Updated 6 years ago
VIStA-H / GPT-4V_Social_Media
View on GitHub
GPT-4V(ision) as A Social Media Analysis Engine
☆39Dec 20, 2024Updated last year
DAMO-NLP-SG / Inf-CLIP
View on GitHub
[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for C…
☆287Jan 16, 2025Updated last year
OurBluePrint / easy_video
View on GitHub
☆20Mar 3, 2025Updated last year
jianzongwu / betrayed-by-captions
View on GitHub
(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
☆48Jul 18, 2024Updated 2 years ago
SaFo-Lab / AdaShield
View on GitHub
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆73Feb 9, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rain305f / TIDA
View on GitHub
[NeurIPS 2023] Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning
☆17Apr 15, 2024Updated 2 years ago
lkhl / tiny-transformers
View on GitHub
[ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"
☆84Jul 21, 2022Updated 4 years ago
983632847 / SAM-for-Videos
View on GitHub
This repository is for the first survey on SAM & SAM2 for Videos.
☆53Apr 29, 2025Updated last year
munanning / MADAv2
View on GitHub
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation
☆25Jul 8, 2023Updated 3 years ago
HYUNJS / STTM
View on GitHub
[ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
☆61Feb 2, 2026Updated 5 months ago
PKU-YuanGroup / LLMBind
View on GitHub
LLMBind: A Unified Modality-Task Integration Framework
☆19Jun 16, 2024Updated 2 years ago
jbistanbul / MiniROAD
View on GitHub
☆42May 7, 2024Updated 2 years ago
Leon1207 / 3DRefTR
View on GitHub
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
☆26Aug 24, 2023Updated 2 years ago
wudongming97 / OnlineRefer
View on GitHub
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
☆58Oct 7, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EIT-NLP / Speak-While-Watching
View on GitHub
☆17Mar 1, 2026Updated 4 months ago
bytedance / fc-clip
View on GitHub
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆345Feb 5, 2024Updated 2 years ago
alanaai / EVUD
View on GitHub
Egocentric Video Understanding Dataset (EVUD)
☆34Jul 4, 2024Updated 2 years ago
yanmin-wu / EDA
View on GitHub
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
☆134Oct 11, 2023Updated 2 years ago
heshuting555 / RefMask3D
View on GitHub
[ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
☆65Jul 29, 2024Updated last year
PinxueGuo / X-Prompt
View on GitHub
☆17Oct 4, 2024Updated last year
OpenGVLab / TPO
View on GitHub
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
☆65Jul 22, 2025Updated last year