liruiw / HPT
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
☆374Updated last month
Related projects ⓘ
Alternatives and complementary repositories for HPT
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆159Updated 3 weeks ago
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation☆432Updated this week
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆489Updated 2 months ago
- A curated list of awesome papers on Embodied AI and related research/industry-driven resources.☆284Updated 3 months ago
- Code for RoboFlamingo☆309Updated 6 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆341Updated 2 weeks ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆179Updated 6 months ago
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆333Updated 4 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆363Updated last month
- A Survey of Embodied Learning for Object-Centric Robotic Manipulation☆131Updated last month
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆313Updated last week
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆160Updated last month
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆174Updated 2 months ago
- A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.☆152Updated this week
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆486Updated this week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆58Updated 5 months ago
- ☆286Updated 6 months ago
- ☆285Updated 6 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆84Updated 4 months ago
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆122Updated 3 months ago
- GRUtopia: Dream General Robots in a City at Scale☆504Updated 2 months ago
- Implementation of "PaLM-E: An Embodied Multimodal Language Model"☆271Updated 9 months ago
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models☆576Updated 6 months ago
- Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis☆360Updated last month
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆223Updated 2 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆170Updated 6 months ago
- Benchmarking Knowledge Transfer in Lifelong Robot Learning☆238Updated 2 months ago
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vi…☆543Updated last week
- ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation☆510Updated 2 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setu…☆39Updated 2 months ago