CraftJarvis/ROCKET-1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CraftJarvis/ROCKET-1)

CraftJarvis / ROCKET-1

Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR'25)

☆46

Alternatives and similar repositories for ROCKET-1

Users that are interested in ROCKET-1 are comparing it to the libraries listed below

Sorting:

CraftJarvis / GROOT
View on GitHub
GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)
☆67Dec 18, 2023Updated 2 years ago
CraftJarvis / OpenHA
View on GitHub
Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"
☆24Feb 5, 2026Updated last month
linhaowei1 / kumo
View on GitHub
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
☆19Jun 4, 2025Updated 9 months ago
JiuTian-VL / Optimus-1
View on GitHub
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆95Jun 17, 2025Updated 8 months ago
CraftJarvis / MCU
View on GitHub
☆37Oct 21, 2025Updated 4 months ago
WellyZhang / ALANS
View on GitHub
Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
☆11Jul 20, 2022Updated 3 years ago
trueagi-io / minecraft-demo
View on GitHub
☆13Dec 5, 2025Updated 3 months ago
tsinghua-fib-lab / TrajAgent
View on GitHub
[NeurIPS 2025] TrajAgent: An LLM-Agent Framework for Trajectory Modeling via Large-and-Small Model Collaboration
☆21Nov 30, 2025Updated 3 months ago
CraftJarvis / MineStudio
View on GitHub
MineStudio: A Streamlined Package for Minecraft AI Agent Development
☆348Feb 7, 2026Updated 3 weeks ago
elated-sawyer / WALL-E
View on GitHub
Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
☆57Dec 3, 2025Updated 3 months ago
FreedomIntelligence / LongLLaVA
View on GitHub
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
☆213Jan 6, 2025Updated last year
PKU-RL / Creative-Agents
View on GitHub
☆47Dec 11, 2023Updated 2 years ago
CraftJarvis / JarvisVLA
View on GitHub
Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"
☆134Aug 27, 2025Updated 6 months ago
Zhoues / MineDreamer
View on GitHub
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆104Jun 16, 2025Updated 8 months ago
CraftJarvis / OmniJARVIS
View on GitHub
☆30Jun 25, 2024Updated last year
Moe-Team / MysteryOnline
View on GitHub
Actually released!
☆10Feb 24, 2021Updated 5 years ago
CraftJarvis / JARVIS-1
View on GitHub
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆389Apr 8, 2024Updated last year
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆31Jan 20, 2025Updated last year
HeliosZhao / GenXD
View on GitHub
GenXD: Generating Any 3D and 4D Scenes. ICLR 2025
☆221Mar 30, 2025Updated 11 months ago
belindal / LaMPP
View on GitHub
Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action
☆37Apr 3, 2023Updated 2 years ago
PawCorp / walltaker-desktop-client
View on GitHub
Desktop client for Walltaker powered by golang
☆12Sep 13, 2022Updated 3 years ago
nicolaus-huang / ProcessPainter
View on GitHub
[SIGGRAPH Asia 2024] Painting process generating using diffusion models
☆94Nov 12, 2025Updated 3 months ago
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆37Apr 17, 2023Updated 2 years ago
minerllabs / basalt-2022-behavioural-cloning-baseline
View on GitHub
Simple behavioural cloning baseline solution for BASALT 2022
☆32Nov 3, 2022Updated 3 years ago
xverse-ai / XVERSE-MoE-A36B
View on GitHub
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Sep 12, 2024Updated last year
anniedoris / design_qa
View on GitHub
☆57Aug 10, 2025Updated 6 months ago
KegangWangCCNU / PhysBench
View on GitHub
A large-scale training and benchmarking framework for rPPG.
☆10Nov 26, 2024Updated last year
interactivefiction / vimform7
View on GitHub
Vim environment for authoring, compiling, and debugging Inform7 based interactive fiction works.
☆11Aug 22, 2020Updated 5 years ago
asappresearch / webagents-step
View on GitHub
☆41Jul 21, 2024Updated last year
OHDSI / Koios
View on GitHub
Tool to identify concept in the OMOP Genomic vocabulary from VCF and other files as well as HGVS notations
☆15May 6, 2024Updated last year
gkswamy98 / causal_il
View on GitHub
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…
☆11Dec 9, 2022Updated 3 years ago
p-ortmann / dyntapy
View on GitHub
☆10Mar 8, 2024Updated last year
Open-Hexagon / oh-ce
View on GitHub
A rewrite of Open Hexagon
☆12Feb 21, 2026Updated last week
DeckardAgent / deckard
View on GitHub
Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"
☆94May 23, 2023Updated 2 years ago
yuPeiyu98 / Deep-Region-Competition
View on GitHub
[NeurIPS 2021] Unsupervised Foreground Extraction via Deep Region Competition
☆43Updated this week
rese1f / STEVE
View on GitHub
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
yu-shaonian / AnimateAnything
View on GitHub
CVPR2025:AnimateAnything
☆186Jun 3, 2025Updated 9 months ago
ktvng / pebble
View on GitHub
Natural language programming language
☆11Aug 28, 2020Updated 5 years ago
ml-postech / BEAG
View on GitHub
☆11Jul 4, 2024Updated last year