(TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"
☆26Mar 14, 2025Updated last year
Alternatives and similar repositories for HD-OVD
Users that are interested in HD-OVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆30Jul 23, 2024Updated last year
- ☆10Oct 25, 2024Updated last year
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆26Dec 30, 2024Updated last year
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆103Mar 18, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- ☆25Jan 12, 2026Updated 3 months ago
- Official code for paper 'FFE-CycleGAN: A specialized optimization method of CycleGAN for VIS-NIR Heterogeneous Face Recognition'☆13Sep 23, 2021Updated 4 years ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- ☆25Dec 23, 2024Updated last year
- ☆27Oct 1, 2025Updated 6 months ago
- Automated Segmentation of Prohibited Items in X-ray Baggage Images Using Dense De-overlap Attention Snake, TMM 2022☆13Dec 28, 2022Updated 3 years ago
- Vue component for Plaid Link☆10Sep 30, 2025Updated 6 months ago
- Official repository for Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments☆16Jul 9, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆41Mar 18, 2025Updated last year
- YOLOX implemented by pytorch lightning, a simpler expression of pytorch☆11May 26, 2022Updated 3 years ago
- One-Shot Unsupervised Cross Domain Detection☆13Nov 22, 2022Updated 3 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated 3 months ago
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- ☆60Aug 12, 2024Updated last year
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated 3 months ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆48Oct 19, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 5 months ago
- ☆12Nov 4, 2024Updated last year
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆46Mar 25, 2025Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆200Apr 16, 2023Updated 3 years ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆111Sep 17, 2025Updated 6 months ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆27Sep 21, 2025Updated 6 months ago
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆202Feb 5, 2024Updated 2 years ago
- ☆11Mar 4, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- VarGFaceNet Pytorch Implementation with AdaFace for LRFR☆10Aug 25, 2023Updated 2 years ago
- ☆18Nov 15, 2024Updated last year
- Source code for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach (CVPR 2024)☆28Dec 3, 2024Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆51Sep 24, 2024Updated last year
- Frames Extraction With OpenCV and Python☆15Aug 26, 2020Updated 5 years ago
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆22May 16, 2025Updated 11 months ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated last year