CraftJarvis / JARVIS-1
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆330Updated 5 months ago
Related projects: ⓘ
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆249Updated 2 months ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆349Updated 11 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆239Updated this week
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆168Updated 3 months ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆623Updated last month
- Code for Quiet-STaR☆478Updated last month
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆439Updated 6 months ago
- ☆357Updated 11 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆248Updated last year
- General multi-task deep RL Agent☆158Updated 3 months ago
- ☆262Updated this week
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆332Updated 11 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆681Updated last month
- An implemtation of Everyting of Thoughts (XoT).☆114Updated 6 months ago
- ☆242Updated 2 weeks ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆200Updated last month
- A codebase for "Language Models can Solve Computer Tasks"☆218Updated 4 months ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆154Updated this week
- 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆249Updated 4 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆303Updated 4 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆530Updated 10 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆305Updated last week
- The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs☆274Updated 3 weeks ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆331Updated last week
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆209Updated 3 weeks ago
- ☆111Updated 3 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆304Updated 11 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆170Updated 5 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆439Updated 3 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆256Updated 2 weeks ago