☆52Jul 7, 2025Updated 9 months ago
Alternatives and similar repositories for DINO-R1
Users that are interested in DINO-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PCA: Progressive Cross-modal Association Learning for Unsupervised Visible-Infrared Person Re-Identification☆10Dec 23, 2025Updated 4 months ago
- Make Large Multimodal Models excel in object detection, ICCV 2025☆64Aug 1, 2025Updated 9 months ago
- DescribeEarth: Describe Anything for Remote Sensing Images☆26Mar 6, 2026Updated last month
- CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine☆32Feb 2, 2026Updated 3 months ago
- Data and code required to reach the main conclusions of the fastsmcg paper☆10Sep 19, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆36Apr 9, 2026Updated 3 weeks ago
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆16Mar 18, 2026Updated last month
- Implementation for paper Automata Extraction from Transformers.☆12Jun 8, 2024Updated last year
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆239Aug 3, 2022Updated 3 years ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆24May 29, 2025Updated 11 months ago
- Low-rank adaptation of large language models (LoRA) for Segment Anything 2.☆18Oct 31, 2024Updated last year
- [ECCV 2024] The official implementation for "Embracing Events and Frames with Hierarchical Feature Refinement Network for Robust Object D…☆21Mar 24, 2025Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆67Aug 9, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆41May 30, 2025Updated 11 months ago
- Official repository for CVPR 2024 paper "Advancing Saliency Ranking with Human Fixations: Dataset, Models and Benchmarks".☆21Jun 21, 2024Updated last year
- CoRL 2025☆48Sep 20, 2025Updated 7 months ago
- ☆17Dec 11, 2024Updated last year
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆64Apr 10, 2026Updated 3 weeks ago
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆90Dec 23, 2025Updated 4 months ago
- ☆13Aug 5, 2024Updated last year
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆46Apr 18, 2024Updated 2 years ago
- RS Generate dataset☆18Jan 2, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2026] DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆111Mar 21, 2026Updated last month
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆353Nov 6, 2025Updated 5 months ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- [IEEE SPM 2020] Collect some papers about event-based Autonomous Driving & Event-based Robotic-Grasping.☆18May 9, 2025Updated 11 months ago
- Does Diffusion Beat GAN in Image Super Resolution?☆12May 27, 2024Updated last year
- ☆13Nov 7, 2021Updated 4 years ago
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆821Jul 27, 2025Updated 9 months ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation for "Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts"☆22Jun 28, 2025Updated 10 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆134Feb 4, 2026Updated 2 months ago
- ☆12Dec 9, 2022Updated 3 years ago
- [CVPR2023] Practical Network Acceleration with Tiny Sets☆14Jul 28, 2023Updated 2 years ago
- LITE: A Paradigm Shift: Multi Object Tracking with Deep Association Metric☆28Updated this week
- ☆14Mar 15, 2025Updated last year
- Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images☆51Aug 26, 2025Updated 8 months ago