HAWLYQ/ET-Cap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HAWLYQ/ET-Cap)

HAWLYQ / ET-Cap

☆24

Alternatives and similar repositories for ET-Cap

Users that are interested in ET-Cap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CSir1996 / VLN-GELA
View on GitHub
Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)
☆21Oct 21, 2023Updated 2 years ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
jialuli-luka / EnvEdit
View on GitHub
Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)
☆30Aug 2, 2022Updated 3 years ago
wz0919 / VLN-SRDF
View on GitHub
Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
☆35Jun 10, 2025Updated last year
YanyuanQiao / MiC
View on GitHub
Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"
☆26May 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HLR / VLN-trans
View on GitHub
[ACL2023] Official code repository for VLN-Trans
☆14Sep 10, 2023Updated 2 years ago
cshizhe / onav_rim
View on GitHub
☆46Sep 30, 2023Updated 2 years ago
vlc-robot / robot_sugar
View on GitHub
Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).
☆46Jun 19, 2025Updated last year
MrZihan / Sim2Real-VLN-3DFF
View on GitHub
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
☆79Dec 26, 2025Updated 6 months ago
zehao-wang / LAD
View on GitHub
Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].
☆16Apr 13, 2023Updated 3 years ago
jialuli-luka / VLN-SIG
View on GitHub
☆35Aug 19, 2023Updated 2 years ago
wz0919 / waypoint-predictor
View on GitHub
Training code of waypoint predictor in Discrete-to-Continuous VLN.
☆32Mar 25, 2024Updated 2 years ago
YicongHong / Ego2Map-NaViT
View on GitHub
Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)
☆28Jul 30, 2023Updated 2 years ago
iSEE-Laboratory / VLN-PRET
View on GitHub
☆23Oct 19, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PeihaoChen / WS-MGMap
View on GitHub
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…
☆35Apr 23, 2023Updated 3 years ago
vlc-robot / hiveformer
View on GitHub
☆33Sep 25, 2024Updated last year
HAWLYQ / Qc-TextCap
View on GitHub
☆16Dec 25, 2021Updated 4 years ago
cshizhe / VLN-HAMT
View on GitHub
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
☆146Jun 14, 2023Updated 3 years ago
YicongHong / Recurrent-VLN-BERT
View on GitHub
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
☆208Aug 13, 2022Updated 3 years ago
CrystalSixone / VLN-MAGIC
View on GitHub
This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…
☆17May 17, 2026Updated 2 months ago
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
jacobkrantz / Sim2Sim-VLNCE
View on GitHub
Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
☆35Dec 16, 2023Updated 2 years ago
vlc-robot / polarnet
View on GitHub
[CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
☆43Jun 4, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
YanyuanQiao / HOP-VLN
View on GitHub
Code of the CVPR 2022 paper "HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation"
☆31Aug 21, 2023Updated 2 years ago
JeremyLinky / YouTube-VLN
View on GitHub
[ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos
☆71Dec 27, 2024Updated last year
ggeorgak11 / CM2
View on GitHub
☆59Apr 1, 2022Updated 4 years ago
MrZihan / HNR-VLN
View on GitHub
Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…
☆109Apr 2, 2025Updated last year
Li-ChangHao / CoNav
View on GitHub
☆12Jul 16, 2024Updated 2 years ago
YicongHong / Discrete-Continuous-VLN
View on GitHub
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…
☆155Oct 31, 2023Updated 2 years ago
lixyresearch / KERM
View on GitHub
Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23）
☆45Aug 6, 2024Updated last year
MrZihan / GridMM
View on GitHub
Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).
☆105Apr 18, 2024Updated 2 years ago
MrZihan / Image2Sim
View on GitHub
Official implementation of "Image2Sim: Scaling Embodied Navigation via Generative Neural Simulator"
☆15Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bareblackfoot / Object2HabitatMap
View on GitHub
Awesome habitat top down map work 🤩
☆35Apr 7, 2024Updated 2 years ago
expectorlin / CONSOLE
View on GitHub
Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)
☆16Jun 7, 2024Updated 2 years ago
cshizhe / VLN-DUET
View on GitHub
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
☆282Jun 27, 2023Updated 3 years ago
MrZihan / NavRAG
View on GitHub
Official implementation of "NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM" (ACL'25 …
☆60Mar 6, 2025Updated last year
hekj / FDA
View on GitHub
Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)
☆14Jan 8, 2024Updated 2 years ago
zhaoc5 / Grounding-REVERIE-Challenge
View on GitHub
Official REVERIE Grounding Model of REVERIE Challenge @ CSIG 2022
☆19Oct 17, 2022Updated 3 years ago
intelligolabs / Le-RNR-Map
View on GitHub
[ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
☆17Dec 3, 2024Updated last year