EgoAlpha / Awesome-EgocentricLinks
☆46Updated last year
Alternatives and similar repositories for Awesome-Egocentric
Users that are interested in Awesome-Egocentric are comparing it to the libraries listed below
Sorting:
- A curated list of research papers in Vision-Language Navigation (VLN)☆219Updated last year
- Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"☆52Updated 10 months ago
- Official codebase for EmbCLIP☆130Updated 2 years ago
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆34Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆23Updated 8 months ago
- 🔀 Visual Room Rearrangement☆122Updated 2 years ago
- This repository is a collection of research papers on World Models.☆38Updated last year
- ☆26Updated 2 years ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆120Updated last year
- Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation☆186Updated 3 years ago
- Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).☆126Updated 2 years ago
- Official implementation of the NRNS paper☆36Updated 3 years ago
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆26Updated 2 months ago
- DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments☆21Updated 4 months ago
- ☆83Updated 2 years ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆60Updated 8 months ago
- A curated list about Awesome Embodied AI works and is still in construct. Now it contains a list of Simulators, Tasks and Datasets.☆31Updated 5 years ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆81Updated 3 months ago
- Ideas and thoughts about the fascinating Vision-and-Language Navigation☆252Updated 2 years ago
- ☆14Updated 3 years ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆204Updated 2 years ago
- Target journals and conferences in the field of robotics and computer vision.☆161Updated last year
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆130Updated 10 months ago
- Some experiences for new researchers to grow grow up☆42Updated 2 years ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆82Updated last year
- ☆19Updated last year
- ☆10Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆144Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆44Updated last year
- ☆18Updated 2 years ago