JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆399Apr 8, 2024Updated 2 years ago
Alternatives and similar repositories for JARVIS-1
Users that are interested in JARVIS-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆69Dec 18, 2023Updated 2 years ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated 2 years ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆46Aug 15, 2023Updated 2 years ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆293Aug 3, 2023Updated 2 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆211Jun 4, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆103Jun 16, 2025Updated last year
- ☆30Jun 25, 2024Updated 2 years ago
- [CVPR2024] This is the official implement of MP5☆105Jun 30, 2024Updated 2 years ago
- Official Implementation of Paper "ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment" (AAAI'26)☆42Jul 2, 2025Updated last year
- Paper List of Minecraft Agents☆69May 24, 2026Updated last month
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆20Jun 4, 2025Updated last year
- Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR'25)☆46Apr 13, 2025Updated last year
- MineStudio: A Streamlined Package for Minecraft AI Agent Development☆387May 12, 2026Updated last month
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆35Feb 10, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆101Jun 17, 2025Updated last year
- An Open-Ended Embodied Agent with Large Language Models☆7,015Apr 3, 2024Updated 2 years ago
- ☆53Oct 21, 2025Updated 8 months ago
- Foundation Model for MineDojo☆300Apr 2, 2023Updated 3 years ago
- ☆49Dec 11, 2023Updated 2 years ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆161Aug 27, 2025Updated 10 months ago
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆31Apr 7, 2025Updated last year
- ☆89Dec 15, 2023Updated 2 years ago
- The official implementation of the paper "Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork".☆12Feb 27, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆102Jun 12, 2024Updated 2 years ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆768Feb 1, 2024Updated 2 years ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,226Mar 18, 2024Updated 2 years ago
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…☆641Jun 5, 2023Updated 3 years ago
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,714Sep 3, 2025Updated 10 months ago
- Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs☆112Sep 30, 2025Updated 9 months ago
- Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning☆11Jul 20, 2022Updated 3 years ago
- GPT-4V in Wonderland: LMMs as Smartphone Agents☆134Jul 17, 2024Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆301May 20, 2024Updated 2 years ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,776Sep 9, 2024Updated last year
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Jun 28, 2024Updated 2 years ago
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆37Jun 5, 2026Updated last month
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆417Jan 7, 2026Updated 5 months ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆22Nov 29, 2023Updated 2 years ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago