kyegomez / PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆285Updated last year
Alternatives and similar repositories for PALM-E:
Users that are interested in PALM-E are comparing it to the libraries listed below
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆353Updated 7 months ago
- Democratization of RT-2 "RT-2: New model translates vision and language into action"☆415Updated 6 months ago
- Code for RoboFlamingo☆341Updated 9 months ago
- Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆293Updated last year
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆191Updated this week
- [arXiv 2023] Embodied Task Planning with Large Language Models☆168Updated last year
- Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.☆294Updated last month
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆413Updated last month
- ☆275Updated 3 weeks ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆422Updated 3 months ago
- ☆219Updated last month
- Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis☆400Updated 3 weeks ago
- Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆792Updated 10 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Models☆256Updated 5 months ago
- A flexible and efficient codebase for training visually-conditioned language models (VLMs)☆577Updated 7 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆143Updated 5 months ago
- Generating Robotic Simulation Tasks via Large Language Models☆311Updated 10 months ago
- Implementation of RT1 (Robotic Transformer) in Pytorch☆399Updated 4 months ago
- Voltron: Language-Driven Representation Learning for Robotics☆217Updated last year
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models☆637Updated this week
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆493Updated 3 months ago
- 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆283Updated 9 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆220Updated 10 months ago
- Official Code for RVT-2 and RVT☆310Updated last week
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆171Updated last month
- ☆176Updated 4 months ago
- 🔥[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆187Updated 2 weeks ago
- The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).☆473Updated 9 months ago
- OmniGibson: a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse engine. Join our Discord for support: https://…☆614Updated this week
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆177Updated last year