Gary3410 / TaPALinks

[arXiv 2023] Embodied Task Planning with Large Language Models

☆192

Alternatives and similar repositories for TaPA

Users that are interested in TaPA are comparing it to the libraries listed below

Sorting:

OpenGVLab / Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
☆370Updated last year
embodied-agent-interface / embodied-agent-interface
Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)
☆266Updated 8 months ago
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆323Updated 7 months ago
clorislili / ManipLLM
The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)
☆143Updated last year
stevenyangyj / Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
☆59Updated last year
UMass-Embodied-AGI / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆134Updated last year
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆151Updated 7 months ago
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆117Updated 9 months ago
NVlabs / progprompt-vh
ProgPrompt for Virtualhome
☆144Updated 2 years ago
RoboFlamingo / RoboFlamingo
Code for RoboFlamingo
☆409Updated last year
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆97Updated last year
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆332Updated last week
WayneMao / RoboMatrix
The Official Implementation of RoboMatrix
☆102Updated 6 months ago
Gabesarch / HELPER
☆32Updated last year
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆76Updated 6 months ago
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆214Updated last month
lmzpai / roboMamba
The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`
☆139Updated 11 months ago
kyegomez / RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
☆229Updated last week
EmbodiedGPT / EgoCOT_Dataset
☆54Updated last year
sled-group / navchat
Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …
☆31Updated last year
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆198Updated 5 months ago
H-Freax / Awesome-Video-Robotic-Papers
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…
☆168Updated 9 months ago
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆269Updated last week
allenai / spoc-robot-training
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
☆139Updated last year
kyegomez / PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
☆330Updated last year
OpenHelix-Team / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆323Updated 2 months ago
vimalabs / VIMABench
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
☆320Updated 2 years ago
OSU-NLP-Group / LLM-Planner
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
☆208Updated 7 months ago
bytedance / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆286Updated last year