thunlp / LEGENT
Open Platform for Embodied Agents
β269Updated last month
Related projects β
Alternatives and complementary repositories for LEGENT
- πOctopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.β264Updated 6 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learningβ204Updated last month
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"β225Updated 3 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agentsβ120Updated this week
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi eβ¦β352Updated 2 months ago
- GRUtopia: Dream General Robots in a City at Scaleβ513Updated 2 months ago
- Align Anything: Training All-modality Model with Feedbackβ245Updated last week
- β348Updated last month
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β200Updated this week
- [CVPR2024] This is the official implement of MP5β84Updated 4 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bettβ¦β191Updated this week
- β114Updated 4 months ago
- β89Updated 3 months ago
- β40Updated 11 months ago
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Modelβ334Updated 4 months ago
- β101Updated 2 weeks ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.β259Updated last month
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chainβ99Updated 8 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.β332Updated 6 months ago
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Lifeβ315Updated 4 months ago
- A curated list of awesome papers on Embodied AI and related research/industry-driven resources.β289Updated 3 months ago
- The model, data and code for the visual GUI Agent SeeClickβ226Updated 2 months ago
- Code for RoboFlamingoβ311Updated 6 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Modelsβ236Updated 2 months ago
- A generative and self-guided robotic agent that endlessly propose and master new skills.β597Updated 5 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learningβ366Updated this week
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"β129Updated 3 weeks ago
- RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthinessβ241Updated 2 weeks ago
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulationβ472Updated last week
- Paper collections of the continuous effort start from World Models.β140Updated 4 months ago