xiaomoguhz / OV-DQUOView external linksLinks
[AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
☆35Dec 15, 2024Updated last year
Alternatives and similar repositories for OV-DQUO
Users that are interested in OV-DQUO are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior☆38Jan 31, 2026Updated 2 weeks ago
- ☆35Nov 25, 2025Updated 2 months ago
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆23Mar 14, 2025Updated 11 months ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated last month
- ☆23Aug 20, 2024Updated last year
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆29Jul 23, 2024Updated last year
- List of administrative divisions with standard area codes in Japan☆11Mar 21, 2025Updated 10 months ago
- Road crack segmentation with UNet in PyTorch — Includes implementations of multiple loss functions such as Focal, Dice, and Dice + CE.☆34Apr 15, 2025Updated 10 months ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆215Apr 3, 2025Updated 10 months ago
- Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection☆13Mar 23, 2025Updated 10 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- This repository contains the notebooks for texture classification.☆11May 17, 2022Updated 3 years ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated last month
- A part of the VinDr Lab project. Which performs as the middleware layer between user interface and backend systems.☆10May 20, 2022Updated 3 years ago
- ☆12Jan 2, 2025Updated last year
- OpenLR library for Python☆15Jul 14, 2025Updated 7 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- ☆11Dec 6, 2024Updated last year
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 6 months ago
- [ICPR 2024] Official repository of the paper "GenFormer - Generated Images are All You Need to Improve Robustness of Transformers on Smal…☆14Aug 30, 2024Updated last year
- Tiny configuration for Triton Inference Server☆45Jan 10, 2025Updated last year
- Android video semantic segmentation using DeeplabV3+ lite☆10Sep 20, 2019Updated 6 years ago
- Smart Agent Survey is an application that automates survey response generation by processing survey documents and creating multiple synth…☆11Aug 30, 2025Updated 5 months ago
- [MICCAI 2024] RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features☆10Aug 22, 2025Updated 5 months ago
- ☆22Jan 12, 2026Updated last month
- [ICRA 2024] GelRoller: A Rolling Vision-based Tactile Sensor for Large Surface Reconstruction Using Self-Supervised Photometric Stereo Me…☆12Sep 23, 2024Updated last year
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 2 months ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆23Dec 2, 2024Updated last year
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 2 months ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- Weakly Supervised Referring Video Object Segmentation with Object-Centric Pseudo-Guidance☆10Aug 17, 2024Updated last year
- Additional widgets for the Argos project☆11Aug 28, 2021Updated 4 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- You can track summary covid-19 data for a general or specific country. This application was developed with Vue.js.☆10Jun 23, 2021Updated 4 years ago
- ☆12Aug 19, 2023Updated 2 years ago
- ☆10Dec 5, 2020Updated 5 years ago
- ☆11Jul 26, 2024Updated last year
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks☆11Apr 13, 2023Updated 2 years ago