facebookresearch / EgoTV
EgoTV Egocentric Task Verification from Natural Language Task Descriptions
☆27Updated last year
Alternatives and similar repositories for EgoTV:
Users that are interested in EgoTV are comparing it to the libraries listed below
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆83Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆30Updated last year
- Codebase for HiP☆88Updated last year
- 🔀 Visual Room Rearrangement☆106Updated last year
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆21Updated last year
- code for TIDEE: Novel Room Reorganization using Visuo-Semantic Common Sense Priors☆37Updated last year
- ☆43Updated last year
- ☆73Updated 5 months ago
- Code for the paper Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration☆93Updated 2 years ago
- ☆42Updated 9 months ago
- Official codebase for EmbCLIP☆117Updated last year
- MiniGrid Implementation of BEHAVIOR Tasks☆36Updated 5 months ago
- ☆60Updated 2 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆38Updated last year
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆24Updated 9 months ago
- ☆43Updated 9 months ago
- ☆59Updated 3 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆32Updated last year
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆89Updated last year
- ☆42Updated last month
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆47Updated this week
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆59Updated last year
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 9 months ago
- Instruction Following Agents with Multimodal Transforemrs☆52Updated 2 years ago
- ☆65Updated 3 months ago
- ☆59Updated 4 months ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆32Updated 4 months ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆104Updated last year