hzcar / DUOLinks
Code for ICCV 2025 paper — Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts
☆80Updated 3 months ago
Alternatives and similar repositories for DUO
Users that are interested in DUO are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation☆112Updated last year
- A Unified Driving World Model for Future Generation and Perception☆128Updated 5 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆121Updated last year
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆244Updated 6 months ago
- ☆95Updated last year
- [ACM CSUR 2025] Out-of-Distribution Detection: A Task-Oriented Survey of Recent Advances☆156Updated 4 months ago
- 🌐 Forging Spatial Intelligence: A Survey on Multi-Modal Pre-Training for Autonomous Systems☆22Updated this week
- Code for ICML 2025 paper — Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation☆59Updated 5 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆195Updated last week
- LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation (ICLR 2025)☆36Updated 10 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆226Updated 3 weeks ago
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆72Updated last year
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆284Updated last year
- High Quality Video Reasoning Segmentation☆130Updated last month
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆228Updated last week
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆107Updated 2 weeks ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆211Updated 7 months ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆92Updated 2 years ago
- [CVPR2024] ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments☆42Updated last year
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆138Updated last week
- ☆140Updated 9 months ago
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆189Updated 3 months ago
- CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarch…☆252Updated 2 years ago
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆108Updated last month
- (ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.☆75Updated 5 months ago
- [IEEE TPAMI] Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning☆378Updated 3 weeks ago
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆133Updated this week
- [ICCV23] DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection☆83Updated last year
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆123Updated last month
- [Pattern Recognition 2025] Cross-Modal Adapter for Vision-Language Retrieval☆140Updated 4 months ago