joyhsu0504 / LEFT
☆42Updated 11 months ago
Alternatives and similar repositories for LEFT:
Users that are interested in LEFT are comparing it to the libraries listed below
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago
- ☆13Updated 9 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- Code for Stable Control Representations☆24Updated 2 months ago
- ☆21Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆60Updated last year
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆36Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆27Updated 3 months ago
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆42Updated last year
- ☆17Updated 9 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 9 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆47Updated 3 months ago
- ☆43Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆83Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆90Updated 2 years ago
- MiniGrid Implementation of BEHAVIOR Tasks☆42Updated 7 months ago
- ☆46Updated 3 months ago
- ☆56Updated last week
- ☆67Updated 6 months ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆27Updated 5 months ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- [NeurIPS 2024] Official code repository for MSR3D paper☆45Updated 3 weeks ago
- General-purpose Visual Understanding Evaluation☆20Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆127Updated last year
- SNARE Dataset with MATCH and LaGOR models☆24Updated last year
- Personal Python toolbox☆15Updated 8 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆38Updated last month
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 2 months ago