H-EmbodVis / NAUTILUSLinks
[NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
☆350Updated last month
Alternatives and similar repositories for NAUTILUS
Users that are interested in NAUTILUS are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 3 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆274Updated 5 months ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 5 months ago
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆352Updated 3 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last month
- ☆316Updated 3 months ago
- (NeurIPS 2025) UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation☆175Updated 2 months ago
- ☆69Updated 5 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 6 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 5 months ago
- PySegMetrics (PSM): A Python-based Simple yet Efficient Evaluation Toolbox for Segmentation-like tasks☆122Updated last year
- [Accepted by Information Fusion] Official code of the paper "Relational Representation Learning Network for Cross-Spectral Image Patch Ma…☆33Updated 4 months ago
- (TIP 2022) Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction☆110Updated 10 months ago
- This is the project for the paper of "Boosting Image Restoration via Priors from Pre-trained Models" in CVPR2024☆95Updated 7 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago
- Practical New Tasks and Inspiring Modeling Solutions for Diverse Open Vision Problems☆139Updated 3 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆480Updated last week
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆117Updated last month
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- ☆207Updated 8 months ago
- The official repository for ArGue: Attribute-Guided Prompt Tuning For Vision-Language Models☆141Updated last year
- 「ICCV25 highlight」 Official implementation of “Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocab…☆48Updated last month
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆159Updated last month
- Official code of the paper "Why and How: Knowledge-Guided Learning for Cross-Spectral Image Patch Matching"☆43Updated 11 months ago
- [AAAI 2024] Official code for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation☆62Updated last year
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆310Updated 4 months ago
- Official Code for “Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution”☆151Updated last year
- CAASR: A Real-World Animation Super-Resolution Benchmark with Color Degradation and Multi-Scale Multi-Frequency Alignment☆97Updated 5 months ago