thunlp / LEGENTLinks
Open Platform for Embodied Agents
β326Updated 6 months ago
Alternatives and similar repositories for LEGENT
Users that are interested in LEGENT are comparing it to the libraries listed below
Sorting:
- [ECCV2024] πOctopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.β290Updated last year
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasksβ156Updated 2 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"β265Updated 4 months ago
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.β286Updated last month
- β45Updated 4 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learningβ376Updated 7 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)β221Updated 5 months ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ225Updated 3 months ago
- [CVPR2024] This is the official implement of MP5β103Updated last year
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ306Updated 10 months ago
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.β163Updated 3 weeks ago
- [arXiv 2023] Embodied Task Planning with Large Language Modelsβ188Updated last year
- [ICML 2024] Official code repository for 3D embodied generalist agent LEOβ451Updated 3 months ago
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Modelβ366Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Modelsβ196Updated 4 months ago
- β44Updated last year
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).β117Updated last year
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chainβ105Updated last year
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.ioβ244Updated 2 months ago
- β131Updated last year
- All about Robotics and AI Agents you need are hereβ31Updated last year
- β109Updated 4 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorldβ57Updated 10 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β269Updated last month
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.β290Updated 2 months ago
- Code for RoboFlamingoβ397Updated last year
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β372Updated 5 months ago
- A simulation platform for versatile Embodied AI research and developments.β932Updated 2 weeks ago
- Virtual Community: An Open World for Humans, Robots, and Societyβ150Updated 2 weeks ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Metβ¦β150Updated 11 months ago