Kelym / FASTLinks

Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"

☆62

Alternatives and similar repositories for FAST

Users that are interested in FAST are comparing it to the libraries listed below

Sorting:

chihyaoma / selfmonitoring-agent
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆122Updated 2 years ago
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆144Updated 4 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆139Updated 3 years ago
weituo12321 / PREVALENT
large scale pretrain for navigation task
☆93Updated 2 years ago
chihyaoma / regretful-agent
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
☆125Updated 2 years ago
HanqingWangAI / Active_VLN
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
☆43Updated 3 years ago
danielgordon10 / thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆126Updated 5 years ago
mmurray / cvdn
Cooperative Vision-and-Dialog Navigation
☆72Updated 3 years ago
allenai / savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
☆194Updated 3 weeks ago
Sha-Lab / babywalk
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Updated 3 years ago
yeezhu / CMN.pytorch
PyTorch implementation of "Vision-Dialog Navigation by Exploring Cross-modal Memory", CVPR 2020.
☆19Updated 3 years ago
google-research / valan
Vision and Language Agent Navigation
☆82Updated 4 years ago
ZhuFengdaaa / MG-AuxRN
code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"
☆23Updated 4 years ago
zilongzheng / visdial-gnn
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Updated 4 years ago
sibeiyang / sgmn
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
☆116Updated 5 years ago
VegB / VLN-Transformer
Implementation of "Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation"
☆26Updated 4 years ago
HanqingWangAI / SSM-VLN
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
☆42Updated 4 years ago
yikang-li / FactorizableNet
Factorizable Net (Multi-GPU version): An Efficient Subgraph-based Framework for Scene Graph Generation
☆220Updated 6 years ago
lil-lab / touchdown
Cornell Touchdown natural language navigation and spatial reasoning dataset.
☆105Updated 5 years ago
KaihuaTang / VCTree-Scene-Graph-Generation
Code for the Scene Graph Generation part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"
☆124Updated last year
YicongHong / Fine-Grained-R2R
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
☆53Updated 4 years ago
YicongHong / Entity-Graph-VLN
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
☆46Updated 4 years ago
zhangybzbo / EnvBiasVLN
Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"
☆16Updated 5 years ago
YuankaiQi / REVERIE
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
☆147Updated 2 years ago
DmZhukov / CrossTask
☆93Updated 3 years ago
jxwuyi / HouseNavAgent
Navigation agent with Bayesian relational memory in the House3D environment
☆30Updated 6 years ago
Cold-Winter / vqs
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
☆23Updated 8 years ago
lil-lab / ciff
Cornell Instruction Following Framework
☆34Updated 4 years ago
arjunmajum / vln-bert
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆59Updated 3 years ago
debadeepta / vnla
Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
☆61Updated 3 years ago