snumprlab / capeam
Official Implementation of CAPEAM (ICCV'23)
☆9Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for capeam
- Prompter for Embodied Instruction Following☆17Updated 11 months ago
- ☆33Updated 3 weeks ago
- Official Implementation of ReALFRED (ECCV'24)☆24Updated last month
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆36Updated last year
- ☆26Updated last month
- ☆61Updated last month
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆80Updated last year
- ☆42Updated 2 years ago
- Official Implementation of CL-ALFRED (ICLR'24)☆18Updated 3 weeks ago
- ☆46Updated 2 months ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆39Updated 3 months ago
- ☆80Updated last week
- ☆25Updated last year
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆16Updated last month
- ☆41Updated 7 months ago
- ☆77Updated 3 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆30Updated 2 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆25Updated last month
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆21Updated 11 months ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆28Updated last year
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 5 months ago
- ☆39Updated 6 months ago
- PyTorch implementation of the Hiveformer research paper☆47Updated last year
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆49Updated 3 weeks ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆35Updated 4 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆37Updated 7 months ago
- Egocentric Video Understanding Dataset (EVUD)☆24Updated 4 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆41Updated last year
- Official codebase for EmbCLIP☆113Updated last year
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆23Updated 10 months ago