[ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models
☆31Jul 16, 2024Updated last year
Alternatives and similar repositories for BEVInstructor
Users that are interested in BEVInstructor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Oct 19, 2024Updated last year
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆76Dec 26, 2025Updated 2 months ago
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆14Jun 6, 2024Updated last year
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆70Dec 26, 2025Updated 2 months ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- [ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language☆17Dec 3, 2024Updated last year
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆124Apr 12, 2024Updated last year
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆105Apr 2, 2025Updated 11 months ago
- Adding confidence to the SPIN mesh.☆13Jun 25, 2023Updated 2 years ago
- This is the official implementation of "Clustering Propagation for Universal Medical Image Segmentation" (Accepted at CVPR 2024).☆42Apr 11, 2024Updated last year
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆54Dec 20, 2024Updated last year
- ☆14Dec 8, 2025Updated 3 months ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆44Oct 6, 2025Updated 5 months ago
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Jul 15, 2024Updated last year
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆100Jun 4, 2025Updated 9 months ago
- ☆200Mar 29, 2025Updated 11 months ago
- Code for TNNLS paper "Beyond Homophily and Homogeneity Assumption: Relation-based Frequency Adaptive Graph Neural Networks"☆14Feb 27, 2024Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆47Jan 20, 2024Updated 2 years ago
- Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.☆21Jan 2, 2024Updated 2 years ago
- TCSVT'26 & ICASSP'24☆16Mar 15, 2026Updated last week
- Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"☆93Dec 8, 2024Updated last year
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆35Jun 10, 2025Updated 9 months ago
- Pytorch implementation of "DAMA - Multiplexed Immunofluorescence Brain Image Analysis Using Self-Supervised Dual-Loss Adaptive Masked Aut…☆17Oct 20, 2023Updated 2 years ago
- Repository for 3DV2022 paper "Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery"☆19Mar 22, 2023Updated 3 years ago
- [CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …☆17Jun 11, 2024Updated last year
- Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).☆102Apr 18, 2024Updated last year
- A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention☆13Sep 25, 2021Updated 4 years ago
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆427Apr 5, 2025Updated 11 months ago
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆68Nov 2, 2025Updated 4 months ago
- ☆19Jun 26, 2024Updated last year
- Globally Consistent Probabilistic Human Motion Estimation☆23Feb 28, 2023Updated 3 years ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 4 years ago
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆26Mar 27, 2024Updated last year
- Deep adaptive hiding network for image hiding using attentive frequency extraction and gradual depth extraction☆17Aug 3, 2023Updated 2 years ago
- Code about the paper "Joint Contrastive Triple-learning for Deep Multi-view Clustering"☆15Sep 23, 2024Updated last year
- ☆20May 7, 2025Updated 10 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆45Jun 19, 2025Updated 9 months ago