thunlp / LEGENT
Open Platform for Embodied Agents
β297Updated 2 months ago
Alternatives and similar repositories for LEGENT:
Users that are interested in LEGENT are comparing it to the libraries listed below
- [ECCV2024] πOctopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.β284Updated 9 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"β245Updated this week
- Towards Large Multimodal Models as Visual Foundation Agentsβ192Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learningβ315Updated 2 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)β176Updated last week
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi eβ¦β406Updated 2 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Modelsβ170Updated last year
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Modelβ354Updated 8 months ago
- β125Updated 8 months ago
- [CVPR2024] This is the official implement of MP5β96Updated 8 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ259Updated 5 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.β215Updated last month
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β323Updated 2 weeks ago
- GRUtopia: Dream General Robots in a City at Scaleβ678Updated this week
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesisβ111Updated this week
- [ICML 2024] Official code repository for 3D embodied generalist agent LEOβ415Updated last month
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Lifeβ338Updated 3 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.β162Updated last week
- β413Updated 5 months ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Modelsβ164Updated this week
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"β153Updated 2 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.β168Updated this week
- [CVPR'25] RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthinessβ306Updated last week
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chainβ102Updated 11 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorldβ52Updated 5 months ago
- β320Updated 10 months ago
- The model, data and code for the visual GUI Agent SeeClickβ330Updated 3 months ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agentsβ308Updated 10 months ago