FanScy/BEVInstructor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FanScy/BEVInstructor)

FanScy / BEVInstructor

[ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models

☆31

Alternatives and similar repositories for BEVInstructor

Users that are interested in BEVInstructor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iSEE-Laboratory / VLN-PRET
View on GitHub
☆23Oct 19, 2024Updated last year
MrZihan / Sim2Real-VLN-3DFF
View on GitHub
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
☆79Dec 26, 2025Updated 6 months ago
CrystalSixone / VLN-MAGIC
View on GitHub
This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…
☆17May 17, 2026Updated 2 months ago
refkxh / C-Instructor
View on GitHub
[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting
☆31Dec 16, 2024Updated last year
DefaultRui / BEV-Scene-Graph
View on GitHub
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
☆125Apr 12, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
intelligolabs / Le-RNR-Map
View on GitHub
[ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
☆17Dec 3, 2024Updated last year
DefaultRui / VLN-VER
View on GitHub
[CVPR24] Volumetric Environment Representation for Vision-Language Navigation
☆143Sep 9, 2024Updated last year
MrZihan / HNR-VLN
View on GitHub
Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…
☆109Apr 2, 2025Updated last year
dyh127 / S2VNet
View on GitHub
This is the official implementation of "Clustering Propagation for Universal Medical Image Segmentation" (Accepted at CVPR 2024).
☆42Apr 11, 2024Updated 2 years ago
guikunchen / FEC
View on GitHub
[CVPR'24] Neural Clustering based Visual Representation Learning
☆44Oct 6, 2025Updated 9 months ago
wz0919 / VLN-SRDF
View on GitHub
Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
☆35Jun 10, 2025Updated last year
Hamoon1987 / meshConfidence
View on GitHub
Adding confidence to the SPIN mesh.
☆13Jun 25, 2023Updated 3 years ago
kaist-ami / LaughTalk
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
kagawa588 / GvSeg
View on GitHub
This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).
☆18Jul 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lingorX / LogicSeg
View on GitHub
(ICCV23 Oral) LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning
☆24Apr 11, 2024Updated 2 years ago
meera1hahn / Graph_LED
View on GitHub
Localization via embodied dialog on the navigation graph
☆15Apr 18, 2022Updated 4 years ago
Feliciaxyao / NavMorph
View on GitHub
Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).
☆87Dec 26, 2025Updated 6 months ago
leonnnop / Locater
View on GitHub
[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation
☆47Jan 20, 2024Updated 2 years ago
LYX0501 / InstructNav
View on GitHub
☆209Mar 29, 2025Updated last year
z-x-yang / DoraemonGPT
View on GitHub
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆91Jun 19, 2026Updated last month
CrystalSixone / VLN-GOAT
View on GitHub
Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)
☆103Jun 4, 2025Updated last year
yifeisu / TG-GAT
View on GitHub
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.
☆21Jan 2, 2024Updated 2 years ago
wxh1996 / LANA-VLN
View on GitHub
Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"
☆95Apr 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LirongWu / RFA-GNN
View on GitHub
Code for TNNLS paper "Beyond Homophily and Homogeneity Assumption: Relation-based Frequency Adaptive Graph Neural Networks"
☆14Feb 27, 2024Updated 2 years ago
MapItAnywhere / MapItAnywhere
View on GitHub
Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"
☆101Dec 8, 2024Updated last year
MrZihan / GridMM
View on GitHub
Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).
☆106Apr 18, 2024Updated 2 years ago
HanqingWangAI / CCC-VLN
View on GitHub
Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…
☆28Mar 4, 2022Updated 4 years ago
lpercc / HA3D_simulator
View on GitHub
Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…
☆58Dec 20, 2024Updated last year
ZZWENG / DAPA_release
View on GitHub
Repository for 3DV2022 paper "Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery"
☆19Mar 22, 2023Updated 3 years ago
chenhaomingbob / CSC
View on GitHub
[CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …
☆17Jun 11, 2024Updated 2 years ago
lixyresearch / KERM
View on GitHub
Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23）
☆45Aug 6, 2024Updated last year
xyz9911 / FLAME
View on GitHub
[AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"
☆68Nov 2, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chen-wl20 / SceneCompleter
View on GitHub
SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis
☆36Jun 13, 2025Updated last year
Wangdai-0800 / CoordinateAttention_Keras
View on GitHub
A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention
☆13Sep 25, 2021Updated 4 years ago
ethz-mrl / GloPro
View on GitHub
Globally Consistent Probabilistic Human Motion Estimation
☆23Feb 28, 2023Updated 3 years ago
hongsukchoi / HandNeRF_RELEASE
View on GitHub
Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024
☆27Mar 27, 2024Updated 2 years ago
Yevkuzn / CoMoDA
View on GitHub
☆13Feb 12, 2021Updated 5 years ago
weijianan1 / NVI
View on GitHub
[ECCV2024] Nonverbal Interaction Detection
☆31Oct 30, 2024Updated last year
arjung128 / stretch-open
View on GitHub
☆21May 7, 2025Updated last year