H-EmbodVis / NAUTILUSLinks
[NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding
☆57Updated this week
Alternatives and similar repositories for NAUTILUS
Users that are interested in NAUTILUS are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆120Updated last week
- 【ICME2025 Oral】 Offical Pytorch Code for "Learning Dual-Domain Multi-Scale Representations for Single Image Deraining"☆15Updated 7 months ago
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆70Updated last year
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]☆15Updated 6 months ago
- [AAAI 2024] Official code for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation☆61Updated 9 months ago
- [AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video☆108Updated 4 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆116Updated 2 months ago
- A Strong Tracking Framework for 3D SOT on LiDAR Point Clouds☆79Updated 5 months ago
- ☆285Updated 3 weeks ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆208Updated 6 months ago
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆358Updated last week
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆283Updated last year
- LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences☆134Updated 3 weeks ago
- Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation☆116Updated 2 months ago
- ☆94Updated last year
- A Unified Driving World Model for Future Generation and Perception☆121Updated 3 months ago
- ☆89Updated last year
- [ICCV2025] II-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting☆147Updated 2 weeks ago
- [Accepted by ICCV2025] Official code of the paper "From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target De…☆185Updated last week
- Wan2.1 with Controlnet☆178Updated 7 months ago
- Code for paper "RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought"☆93Updated 4 months ago
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆89Updated 5 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆82Updated 8 months ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆350Updated 7 months ago
- [Accepted by TGRS2025] Official code of the paper "Multi-Scale Direction-Aware Network for Infrared Small Target Detection"☆80Updated 3 weeks ago
- ☆37Updated last year
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆52Updated 8 months ago
- [ICCV 2025 Highlight] 🌟🌟🌟 Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts☆179Updated 3 months ago
- ☆67Updated 3 months ago
- The official project website of "ScaleKD: Strong Vision Transformers Could Be Excellent Teachers" (ScaleKD for short, accepted to NeurIPS…☆63Updated 9 months ago