Observation Driven Memory Synergistic Planning for Continuous Vision-Language Navigation
☆29Jun 14, 2024Updated last year
Alternatives and similar repositories for MossVLN
Users that are interested in MossVLN are comparing it to the libraries listed below
Sorting:
- Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA☆11Dec 13, 2023Updated 2 years ago
- Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation☆13Jan 25, 2025Updated last year
- Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering☆13Jan 12, 2024Updated 2 years ago
- A consistent Med-VQA dataset, C-SLAKE , extended by Slake for further consistency assessment .☆13Jan 12, 2024Updated 2 years ago
- The official implementation of "Surface Depth Estimation from Multi-view Stereo Satellite Images with Distribution Contrast Network”☆10May 16, 2025Updated 10 months ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆20Oct 21, 2023Updated 2 years ago
- ☆64Mar 10, 2026Updated last week
- This is the official repository for VLN-CLASH.☆24Aug 5, 2025Updated 7 months ago
- ☆13Oct 15, 2025Updated 5 months ago
- ☆17Jul 21, 2022Updated 3 years ago
- Path planning and Navigation☆14Nov 17, 2024Updated last year
- ☆22Oct 19, 2024Updated last year
- [CVPR Workshop 2025 - OpenSun3D] ForesightNav: Learning Scene Imagination for Efficient Exploration☆70Apr 23, 2025Updated 10 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆260Jun 27, 2023Updated 2 years ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆427Apr 5, 2025Updated 11 months ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 4 years ago
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features☆24Mar 20, 2025Updated last year
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 3 years ago
- SLAM homework based on LVI-SAM with BoW3D and Scan Context loop closure detection module adding.☆16Mar 20, 2024Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Clou…☆17Mar 13, 2026Updated last week
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 4 months ago
- [ACMMM 2025] "Casual3DHDR: High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos"☆29Sep 26, 2025Updated 5 months ago
- [CoRL 2025] CogniPlan: Uncertainty-Guided Path Planning with Conditional Generative Layout Prediction - Public code and model☆45Jan 30, 2026Updated last month
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆44Updated this week
- [CVPR 2026] Official implementation of "ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models"☆71Feb 28, 2026Updated 3 weeks ago
- A Layered Memory Network for MovieQA☆16Apr 27, 2018Updated 7 years ago
- The official repository of the first version of ACE-Brain foundation model.☆62Mar 13, 2026Updated last week
- SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds☆26Nov 26, 2024Updated last year
- ☆18May 7, 2022Updated 3 years ago
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆15Feb 25, 2026Updated 3 weeks ago
- ☆18May 7, 2025Updated 10 months ago
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆35Jun 10, 2025Updated 9 months ago
- Generate Potree compatible LOD data from 3D point clouds on the GPU using CUDA☆16Oct 6, 2023Updated 2 years ago
- The source code of PRA-Net.☆27Oct 4, 2021Updated 4 years ago
- [ECCV 2024] LiDAR-Event Stereo Fusion with Hallucinations☆21Jun 4, 2025Updated 9 months ago
- [RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid☆381Oct 15, 2025Updated 5 months ago
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆137Sep 9, 2024Updated last year
- Official code for "Temporal Event Stereo via Joint Learning with Stereoscopic Flow" (ECCV2024)☆20Oct 1, 2024Updated last year
- The official implementation of "Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer"☆19Dec 9, 2025Updated 3 months ago