CraftJarvis / JARVIS-1
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆339Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for JARVIS-1
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆252Updated 4 months ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆364Updated last year
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆279Updated 3 weeks ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆170Updated 5 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆255Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆682Updated 3 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆448Updated 8 months ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆225Updated 3 weeks ago
- 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆264Updated 6 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆353Updated last year
- ☆373Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆225Updated 6 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆366Updated this week
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆753Updated last month
- VisualWebArena is a benchmark for multimodal agents.☆244Updated last week
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆276Updated 2 months ago
- An implemtation of Everyting of Thoughts (XoT).☆132Updated 9 months ago
- Official Repo for UGround☆97Updated last week
- Code for Quiet-STaR☆651Updated 3 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆259Updated last month
- General multi-task deep RL Agent☆165Updated 5 months ago
- Humanoid Agents: Platform for Simulating Human-like Generative Agents☆263Updated last month
- ☆75Updated 5 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆647Updated last week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"☆716Updated 3 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆220Updated last month
- ☆137Updated 6 months ago
- ☆116Updated 5 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆536Updated last year
- CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆191Updated last week