(TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"
☆25Mar 14, 2025Updated last year
Alternatives and similar repositories for HD-OVD
Users that are interested in HD-OVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…☆26Jan 20, 2026Updated 2 months ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆30Jul 23, 2024Updated last year
- ☆10Oct 25, 2024Updated last year
- ☆23Aug 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆26Dec 30, 2024Updated last year
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆102Mar 18, 2026Updated last week
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆25Nov 28, 2025Updated 3 months ago
- Official code for paper 'FFE-CycleGAN: A specialized optimization method of CycleGAN for VIS-NIR Heterogeneous Face Recognition'☆13Sep 23, 2021Updated 4 years ago
- ☆25Dec 23, 2024Updated last year
- ☆26Oct 1, 2025Updated 5 months ago
- Automated Segmentation of Prohibited Items in X-ray Baggage Images Using Dense De-overlap Attention Snake, TMM 2022☆13Dec 28, 2022Updated 3 years ago
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official repository for Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments☆16Jul 9, 2024Updated last year
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆40Mar 18, 2025Updated last year
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆43Oct 19, 2025Updated 5 months ago
- One-Shot Unsupervised Cross Domain Detection☆13Nov 22, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated 3 months ago
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated 2 months ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆97Sep 17, 2025Updated 6 months ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Nov 4, 2024Updated last year
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆45Mar 25, 2025Updated last year
- Code of ICLR 2025 paper "DynaPrompt: Dynamic Test-Time Prompt Tuning"☆22Jan 29, 2025Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆199Apr 16, 2023Updated 2 years ago
- ☆10Nov 3, 2023Updated 2 years ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆26Sep 21, 2025Updated 6 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- ☆18Nov 15, 2024Updated last year
- Source code for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach (CVPR 2024)☆28Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Dec 11, 2024Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆50Sep 24, 2024Updated last year
- Frames Extraction With OpenCV and Python☆15Aug 26, 2020Updated 5 years ago
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆22May 16, 2025Updated 10 months ago
- YoloV5sl_V4模型pruning☆13May 18, 2021Updated 4 years ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 11 months ago
- A PyTorch toolkit for 2D Human Pose Estimation.☆13Jan 4, 2019Updated 7 years ago