joyhsu0504 / LEFT
☆42Updated 11 months ago
Alternatives and similar repositories for LEFT:
Users that are interested in LEFT are comparing it to the libraries listed below
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- ☆23Updated last year
- ☆13Updated 10 months ago
- Official Code for Neural Systematic Binder☆32Updated 2 years ago
- Code for Stable Control Representations☆24Updated last week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆28Updated 3 months ago
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆36Updated last year
- ☆17Updated 9 months ago
- SNARE Dataset with MATCH and LaGOR models☆24Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 9 months ago
- 📎 + 🦾 CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision☆15Updated 5 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆22Updated 5 months ago
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆42Updated last year
- ☆46Updated 4 months ago
- Personal Python toolbox☆16Updated 9 months ago
- ☆43Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- ☆67Updated 7 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆91Updated 2 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆61Updated last year
- MiniGrid Implementation of BEHAVIOR Tasks☆44Updated 8 months ago
- Program synthesis for 3D spatial reasoning☆25Updated last month
- Python package for importing and loading external assets into AI2THOR☆20Updated 6 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆43Updated 2 months ago
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆53Updated 4 months ago