haosulab / cvpr-tutorial-2022
☆42Updated 2 years ago
Related projects: ⓘ
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆79Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆34Updated 10 months ago
- ☆58Updated 11 months ago
- [CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields☆111Updated 8 months ago
- [ICCV 2023] Official code repository for ARNOLD benchmark☆134Updated 5 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 5 months ago
- ☆41Updated 3 weeks ago
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆79Updated 11 months ago
- 🔀 Visual Room Rearrangement☆104Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆56Updated last month
- ☆63Updated last month
- ☆20Updated 3 months ago
- Code to evaluate a solution in the BEHAVIOR benchmark: starter code, baselines, submodules to iGibson and BDDL repos☆52Updated 5 months ago
- [ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…☆34Updated 8 months ago
- This repository is a collection of research papers on World Models.☆28Updated last year
- ☆11Updated last year
- Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields☆153Updated 6 months ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆66Updated 2 months ago
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆51Updated 7 months ago
- Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)☆34Updated last year
- [RSS 2024] Learning Manipulation by Predicting Interaction☆78Updated last month
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow☆42Updated 7 months ago
- A unified architecture for multimodal multi-task robotic policy learning.☆108Updated 7 months ago
- ☆11Updated 3 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆63Updated this week
- Official repo for "iVideoGPT: Interactive VideoGPTs are Scalable World Models", https://arxiv.org/abs/2405.15223☆60Updated 2 weeks ago
- Official implementation of the NRNS paper☆33Updated 2 years ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆29Updated 3 months ago
- ☆29Updated this week
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆24Updated 8 months ago