xiaomoguhz / OV-DQUOView external linksLinks
[AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
☆35Dec 15, 2024Updated last year
Alternatives and similar repositories for OV-DQUO
Users that are interested in OV-DQUO are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆151Jan 10, 2026Updated last month
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆23Mar 14, 2025Updated 11 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated last month
- ☆23Aug 20, 2024Updated last year
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆29Jul 23, 2024Updated last year
- List of administrative divisions with standard area codes in Japan☆11Mar 21, 2025Updated 10 months ago
- Road crack segmentation with UNet in PyTorch — Includes implementations of multiple loss functions such as Focal, Dice, and Dice + CE.☆34Apr 15, 2025Updated 10 months ago
- Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection☆13Mar 23, 2025Updated 10 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated last month
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- please subscribe the channel☆14Sep 21, 2020Updated 5 years ago
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 6 months ago
- ☆12Jan 2, 2025Updated last year
- ☆11Dec 6, 2024Updated last year
- A part of the VinDr Lab project. Which performs as the middleware layer between user interface and backend systems.☆10May 20, 2022Updated 3 years ago
- [ICPR 2024] Official repository of the paper "GenFormer - Generated Images are All You Need to Improve Robustness of Transformers on Smal…☆14Aug 30, 2024Updated last year
- ☆12Aug 19, 2023Updated 2 years ago
- Android video semantic segmentation using DeeplabV3+ lite☆10Sep 20, 2019Updated 6 years ago
- Official code for paper 'FFE-CycleGAN: A specialized optimization method of CycleGAN for VIS-NIR Heterogeneous Face Recognition'☆13Sep 23, 2021Updated 4 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- ☆10Dec 5, 2020Updated 5 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- VarGFaceNet Pytorch Implementation with AdaFace for LRFR☆10Aug 25, 2023Updated 2 years ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- [MICCAI 2024] RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features☆10Aug 22, 2025Updated 5 months ago
- [ICCV2023] DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration☆12Oct 12, 2023Updated 2 years ago
- ☆14Dec 2, 2025Updated 2 months ago
- Smart Agent Survey is an application that automates survey response generation by processing survey documents and creating multiple synth…☆11Aug 30, 2025Updated 5 months ago
- Bridger is a portable microservice constructor framework that can be attached to all microservices.☆13Jan 12, 2024Updated 2 years ago
- 한글 인식기☆10May 7, 2022Updated 3 years ago
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆23Dec 2, 2024Updated last year
- ☆22Jan 12, 2026Updated last month
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- ☆11Jul 26, 2024Updated last year
- [ICRA 2024] GelRoller: A Rolling Vision-based Tactile Sensor for Large Surface Reconstruction Using Self-Supervised Photometric Stereo Me…☆12Sep 23, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 2 months ago