xushilin1 / dst-detView external linksLinks
[TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det
☆32Jun 3, 2025Updated 8 months ago
Alternatives and similar repositories for dst-det
Users that are interested in dst-det are comparing it to the libraries listed below
Sorting:
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- ☆42Jul 9, 2025Updated 7 months ago
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generation☆19Dec 17, 2025Updated last month
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35May 8, 2025Updated 9 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆110Mar 18, 2024Updated last year
- [ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation☆63Sep 2, 2024Updated last year
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆268Apr 11, 2025Updated 10 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆73Jan 26, 2026Updated 3 weeks ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆29Jul 23, 2024Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Jul 18, 2024Updated last year
- Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)☆29Jan 12, 2024Updated 2 years ago
- This repository contains data and analysis scripts to reproduce the figures as well as source code and simulation scripts to perform the …☆13Apr 13, 2021Updated 4 years ago
- Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"☆132Dec 18, 2025Updated last month
- [ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation☆56Dec 22, 2022Updated 3 years ago
- ☆17Apr 9, 2025Updated 10 months ago
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆190Mar 29, 2025Updated 10 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- ☆134Jul 4, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆79Apr 19, 2025Updated 9 months ago
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆23Mar 14, 2025Updated 11 months ago
- ☆18Nov 15, 2024Updated last year
- 【NeurIPS 2024】Official implementation of "Visual Fourier Prompt Tuning"☆39Jan 17, 2025Updated last year
- CatMAE☆14Dec 13, 2023Updated 2 years ago
- Official implementation of the paper "ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection…☆26Feb 13, 2024Updated 2 years ago
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆159Sep 27, 2024Updated last year
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆156Aug 19, 2023Updated 2 years ago
- [NeurIPS 2024] SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow☆44Dec 1, 2024Updated last year
- ☆18Feb 8, 2026Updated last week
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated last year
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Jul 15, 2024Updated last year
- Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024☆21Dec 30, 2023Updated 2 years ago
- Generate animated visualizations for optical flow fields☆18Mar 12, 2019Updated 6 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆184Oct 25, 2023Updated 2 years ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".☆30Apr 19, 2025Updated 9 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186May 21, 2025Updated 8 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year