(TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"
☆23Mar 14, 2025Updated 11 months ago
Alternatives and similar repositories for HD-OVD
Users that are interested in HD-OVD are comparing it to the libraries listed below
Sorting:
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- ☆18Nov 15, 2024Updated last year
- CatMAE☆14Dec 13, 2023Updated 2 years ago
- ☆18Feb 8, 2026Updated 3 weeks ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆25Sep 21, 2025Updated 5 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated 11 months ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆42Oct 19, 2025Updated 4 months ago
- ☆23Aug 20, 2024Updated last year
- ☆26Mar 26, 2025Updated 11 months ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆40Mar 18, 2025Updated 11 months ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆25Dec 30, 2024Updated last year
- ☆60Aug 12, 2024Updated last year
- ☆25Dec 23, 2024Updated last year
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Nov 21, 2023Updated 2 years ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆29Jul 23, 2024Updated last year
- [AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision☆35Dec 15, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Official PyTorch implementation of PiClick: Picking the desired mask in click-based interactive segmentation.☆26Jul 2, 2024Updated last year
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆55Jun 16, 2025Updated 8 months ago
- Official Implementation of ECCV2024 paper: SLAck☆29Sep 18, 2024Updated last year
- TrackGPT: Track What You Need in Videos via Text Prompts☆25May 16, 2023Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 2 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Jun 3, 2025Updated 9 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- ☆32Feb 29, 2024Updated 2 years ago
- [NeurIPS 2022] Segmenting Moving Objects via an Object-Centric Representation. Junyu Xie, Weidi Xie, Andrew Zisserman.☆32Dec 20, 2023Updated 2 years ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated 11 months ago
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Reinforcement learning environment for UR5e robot with OPENAI gym like format. Include both simulation and real parts.☆14Nov 2, 2021Updated 4 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- 基于个人知识库的AI问答系统SSE流/Agent智能体/知识图谱RAG/FunctionCall/历史消息/图片生成/图片理解/Embedding/VectorDatabase/RAG☆21Sep 7, 2025Updated 5 months ago
- Automated Segmentation of Prohibited Items in X-ray Baggage Images Using Dense De-overlap Attention Snake, TMM 2022☆12Dec 28, 2022Updated 3 years ago
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆11Sep 19, 2025Updated 5 months ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆97Sep 17, 2025Updated 5 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆91Dec 23, 2025Updated 2 months ago