ymq2017/entitysam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ymq2017/entitysam)

ymq2017 / entitysam

[CVPR'2025] EntitySAM: Segment Everything in Video

☆67

Alternatives and similar repositories for entitysam

Users that are interested in entitysam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
iSEE-Laboratory / ReferDINO
View on GitHub
(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
☆142Nov 14, 2025Updated 8 months ago
congvvc / HyperSeg
View on GitHub
[CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".
☆182Dec 13, 2024Updated last year
congvvc / InstructSeg
View on GitHub
[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"
☆56Feb 10, 2025Updated last year
kumuji / Sa2VA-i
View on GitHub
Sa2VA-i is an improved version of the popular Sa2VA model
☆17Nov 25, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HamadYA / SAM3-TrackBench
View on GitHub
This repository contains the implementation of SAM3 trackers.
☆36Jun 30, 2026Updated 3 weeks ago
rui-qian / UGround
View on GitHub
Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)
☆29Jun 18, 2026Updated last month
zhaihongjia / PanoGS
View on GitHub
[CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
☆115Jun 15, 2025Updated last year
hanxunyu / Stream3D-VLM
View on GitHub
[ECCV 2026🔥] Official code repository for "Stream3D-VLM: Online 3D Spatial Understanding with Incremental Geometry Priors"
☆45Jun 23, 2026Updated last month
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
ByChelsea / CMOS
View on GitHub
[CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution
☆10Mar 19, 2024Updated 2 years ago
KanghoonYoon / torch-rasgg
View on GitHub
This is anonymous repository for submitting our work to a conference
☆14Dec 17, 2024Updated last year
wuxiaofei01 / PFVG
View on GitHub
☆20Dec 24, 2025Updated 7 months ago
wookiekim / SOLACE
View on GitHub
SOLACE: Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards (CVPR 2026)
☆17Jun 2, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SysCV / cascade-detr
View on GitHub
[ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection
☆100Sep 12, 2023Updated 2 years ago
rkzheng99 / TMT-VIS
View on GitHub
Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)
☆12May 7, 2025Updated last year
siyuanliii / SLAck
View on GitHub
Official Implementation of ECCV2024 paper: SLAck
☆29Sep 18, 2024Updated last year
ClaudiaCuttano / SAMWISE
View on GitHub
[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆386Sep 25, 2025Updated 10 months ago
YuHengsss / Trident
View on GitHub
[ICCV2025] Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation
☆126Nov 22, 2025Updated 8 months ago
geshang777 / Seg-R1
View on GitHub
[NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"
☆72Jul 1, 2025Updated last year
ProvenceStar / PartGLEE
View on GitHub
[ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
☆64Sep 17, 2024Updated last year
avaxiao / TextRegion
View on GitHub
[TMLR 2025 J2C] TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
☆54Dec 24, 2025Updated 7 months ago
Robiwan245 / SiamMAE
View on GitHub
☆12Mar 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hshjerry / VideoEspresso
View on GitHub
[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
☆140Jul 28, 2025Updated 11 months ago
liuzhuang13 / anytime
View on GitHub
Anytime Dense Prediction with Confidence Adaptivity (ICLR 2022)
☆51Aug 23, 2024Updated last year
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated 2 years ago
xushilin1 / dst-det
View on GitHub
[TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det
☆35Jun 3, 2025Updated last year
AI-Application-and-Integration-Lab / SAM4MLLM
View on GitHub
[ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
☆51Mar 20, 2025Updated last year
haochenheheda / LVVIS
View on GitHub
Large-Vocabulary Video Instance Segmentation dataset
☆99Jul 5, 2024Updated 2 years ago
facebookresearch / PartDistillation
View on GitHub
Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"
☆60Dec 17, 2023Updated 2 years ago
zhouyiks / CoLVA
View on GitHub
☆44Jul 9, 2025Updated last year
Seung-Hun-Lee / CAVIS
View on GitHub
Official code for CAVIS: Context-Aware Video Instance Segmentation
☆116Sep 17, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MICV-yonsei / CASS
View on GitHub
[CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
☆50Mar 27, 2025Updated last year
renyulin-f / MoE-DiffIR
View on GitHub
The code source of MoE-DiffIR
☆50Mar 21, 2025Updated last year
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
rui-qian / READ
View on GitHub
Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)
☆54Feb 4, 2026Updated 5 months ago
zaplm / DC-SAM
View on GitHub
Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"
☆56Dec 26, 2025Updated 7 months ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
showlab / SAM-I2V
View on GitHub
[CVPR 2025] SAM-I2V
☆39Jan 2, 2026Updated 6 months ago