CraftJarvis / JARVIS-1
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆361Updated 11 months ago
Alternatives and similar repositories for JARVIS-1:
Users that are interested in JARVIS-1 are comparing it to the libraries listed below
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆379Updated last year
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆184Updated 9 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆262Updated 8 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆270Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆306Updated 5 months ago
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆284Updated 10 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆415Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆463Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"☆804Updated this week
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆736Updated 7 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆246Updated last week
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆425Updated 2 months ago
- ☆397Updated last year
- Code for Quiet-STaR☆721Updated 7 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆291Updated 10 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆245Updated 5 months ago
- General multi-task deep RL Agent☆178Updated 9 months ago
- ☆81Updated last year
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆189Updated last week
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆385Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆234Updated 10 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆222Updated 4 months ago
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…☆616Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆313Updated 6 months ago
- Reasoning with Language Model is Planning with World Model☆161Updated last year
- An implemtation of Everyting of Thoughts (XoT).☆141Updated last year
- ☆141Updated 10 months ago
- ☆82Updated 9 months ago
- Foundation Model for MineDojo☆260Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆731Updated last month