EgoAlpha / Awesome-EgocentricLinks
☆46Updated last year
Alternatives and similar repositories for Awesome-Egocentric
Users that are interested in Awesome-Egocentric are comparing it to the libraries listed below
Sorting:
- This repository is a collection of research papers on World Models.☆39Updated last year
- DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments☆22Updated 3 months ago
- [CVPR2024] This is the official implement of MP5☆103Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆66Updated 9 months ago
- Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"☆52Updated 8 months ago
- ☆19Updated 11 months ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆24Updated 7 months ago
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆33Updated last year
- 🔀 Visual Room Rearrangement☆118Updated last year
- Official codebase for EmbCLIP☆126Updated 2 years ago
- Official implementation of the NRNS paper☆36Updated 3 years ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆146Updated last week
- ☆83Updated 2 years ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆57Updated 9 months ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆116Updated last year
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆130Updated 8 months ago
- Code for Stable Control Representations☆25Updated 3 months ago
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆98Updated last week
- ☆44Updated 3 years ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆136Updated last month
- Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel☆27Updated last month
- HAZARD challenge☆36Updated 2 months ago
- ☆76Updated 10 months ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆114Updated last year
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆120Updated 2 weeks ago
- Official code release of AAAI 2024 paper SayCanPay.☆49Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆135Updated last year
- Repository for Offline Visual Representation Learning v1 and v2☆13Updated 2 years ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆57Updated 6 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆82Updated last month