ZhuFengdaaa/MG-AuxRN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhuFengdaaa/MG-AuxRN)

ZhuFengdaaa / MG-AuxRN

code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"

☆23

Alternatives and similar repositories for MG-AuxRN

Users that are interested in MG-AuxRN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yeezhu / CMN.pytorch
View on GitHub
PyTorch implementation of "Vision-Dialog Navigation by Exploring Cross-modal Memory", CVPR 2020.
☆19Nov 22, 2022Updated 3 years ago
arjunmajum / vln-bert
View on GitHub
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆59Oct 7, 2022Updated 3 years ago
YuankaiQi / REVERIE
View on GitHub
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
☆158May 15, 2026Updated 2 months ago
airsplay / R2R-EnvDrop
View on GitHub
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆146Oct 23, 2021Updated 4 years ago
Sha-Lab / babywalk
View on GitHub
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Apr 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HanqingWangAI / Active_VLN
View on GitHub
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
☆44Apr 9, 2022Updated 4 years ago
HanqingWangAI / SSM-VLN
View on GitHub
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
☆43Jul 31, 2021Updated 4 years ago
YicongHong / Entity-Graph-VLN
View on GitHub
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
☆47Oct 31, 2021Updated 4 years ago
alloldman / CKR
View on GitHub
☆23Dec 9, 2021Updated 4 years ago
airbert-vln / airbert
View on GitHub
Codebase for the Airbert paper
☆46Mar 20, 2023Updated 3 years ago
HanqingWangAI / VXN
View on GitHub
Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"
☆22Dec 8, 2022Updated 3 years ago
chenjinyubuaa / SEvol
View on GitHub
☆14Sep 21, 2022Updated 3 years ago
changlin31 / BossNAS
View on GitHub
(ICCV 2021) BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
☆143Dec 6, 2021Updated 4 years ago
GT-RIPL / robo-vln
View on GitHub
Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
☆89Jun 27, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
cshizhe / VLN-HAMT
View on GitHub
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
☆147Jun 14, 2023Updated 3 years ago
changlin31 / AutoProg
View on GitHub
(CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers
☆25Feb 26, 2025Updated last year
Theohhhu / UPDeT
View on GitHub
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…
☆139Feb 3, 2021Updated 5 years ago
expectorlin / ADAPT
View on GitHub
code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)
☆10Jul 17, 2022Updated 4 years ago
peteanderson80 / Matterport3DSimulator
View on GitHub
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
☆707Jul 12, 2024Updated 2 years ago
jialuli-luka / EnvEdit
View on GitHub
Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)
☆30Aug 2, 2022Updated 3 years ago
YicongHong / Recurrent-VLN-BERT
View on GitHub
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
☆209Aug 13, 2022Updated 3 years ago
changlin31 / DNA
View on GitHub
(CVPR 2020) Block-wisely Supervised Neural Architecture Search with Knowledge Distillation
☆234Sep 23, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ronghanghu / speaker_follower
View on GitHub
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆138Nov 22, 2022Updated 3 years ago
Augusta-A / Awesome-EfficientVideo
View on GitHub
☆12Sep 11, 2021Updated 4 years ago
CrystalSixone / DSRG
View on GitHub
Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
☆17Apr 25, 2024Updated 2 years ago
Bill1235813 / IVLN
View on GitHub
Implementation (R2R part) for the paper "Iterative Vision-and-Language Navigation"
☆18Apr 4, 2024Updated 2 years ago
Xiaodongsuper / M5Product_toolkit
View on GitHub
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022. Dataset toolkit
☆24Sep 27, 2021Updated 4 years ago
jialuli-luka / PanoGen
View on GitHub
Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
☆83May 31, 2023Updated 3 years ago
youthHan / HVRNet
View on GitHub
Code for Mining Inter-Video Proposal Relations for Video Object Detection, ECCV 2020
☆49Aug 19, 2022Updated 3 years ago
YicongHong / Discrete-Continuous-VLN
View on GitHub
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language N…
☆156Oct 31, 2023Updated 2 years ago
chihyaoma / regretful-agent
View on GitHub
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆125Oct 3, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cshizhe / VLN-DUET
View on GitHub
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
☆284Jun 27, 2023Updated 3 years ago
daqingliu / awesome-vln
View on GitHub
A curated list of research papers in Vision-Language Navigation (VLN)
☆237Apr 17, 2024Updated 2 years ago
PRS-Organization / PRS-Trial-Version
View on GitHub
Trial version for prs platform (python project). Please note that the complete experience requires downloading the Unity resource.
☆10Jun 26, 2024Updated 2 years ago
ADLab-AutoDrive / ICKD
View on GitHub
Offical Code for Paper "Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation"
☆17Jan 19, 2022Updated 4 years ago
feizc / DeeCap
View on GitHub
Dynamic Early Exit for Image Captioning
☆17Oct 25, 2022Updated 3 years ago
prlz77 / orthoreg
View on GitHub
Torch implementation of orthoreg.
☆15Oct 27, 2021Updated 4 years ago
vincentschen / limited-label-scene-graphs
View on GitHub
Scene Graph Prediction with Limited Labels
☆54Oct 3, 2023Updated 2 years ago