zcczhang / UVD
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
☆44Updated last month
Related projects: ⓘ
- Codebase for the 'BestMan' Mobile Manipulator☆89Updated last week
- ☆327Updated 4 months ago
- ☆39Updated 8 months ago
- The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms☆43Updated 2 years ago
- ☆41Updated 3 weeks ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆94Updated last year
- [ICCV 2023 Oral] Pytorch Implementation☆97Updated 10 months ago
- Repo for DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving.☆11Updated 2 weeks ago
- Chain-of-Thought Predictive Control☆54Updated last year
- ☆83Updated last year
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆79Updated last year
- Official Implementation of RoboCLIP (NeurIPS 2023)☆33Updated last month
- Official Code Repo for GENIMA☆45Updated last week
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆34Updated 10 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆66Updated 2 months ago
- ☆23Updated 4 months ago
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆79Updated 11 months ago
- InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)☆25Updated 3 months ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆51Updated 2 months ago
- ☆44Updated 7 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆56Updated last month
- ☆24Updated last week
- Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning☆66Updated 2 months ago
- PyTorch implementation of the Hiveformer research paper☆46Updated last year
- WorldGPT: Empowering LLM as Multimodal World Model☆116Updated last month
- (ICLR 2024) Reverse Forward Curriculum Learning☆36Updated 2 weeks ago
- ☆70Updated last year
- A simple testbed for robotics manipulation policies based on robomimic☆14Updated last week
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆19Updated last year
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆81Updated 5 months ago