JiuTian-VL/Optimus-2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JiuTian-VL/Optimus-2)

JiuTian-VL / Optimus-2

[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

☆24

Alternatives and similar repositories for Optimus-2

Users that are interested in Optimus-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JiuTian-VL / Optimus-1
View on GitHub
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆96Jun 17, 2025Updated 9 months ago
Hyu-Zhang / BiHGH
View on GitHub
[ACMMM 2022 Oral] Official Implementation for Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation
☆11Dec 12, 2022Updated 3 years ago
SaDil13 / VLN-RAM
View on GitHub
Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…
☆17Nov 11, 2025Updated 4 months ago
xiaojieli0903 / FGKVMemPred_video
View on GitHub
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
☆23Jul 11, 2024Updated last year
StarMade / StarMade
View on GitHub
Open source repository of StarMade Coders Pack that you can use to decompile and recompile StarMade to make mods!
☆17Aug 11, 2015Updated 10 years ago
Wordpress hosting with auto-scaling on Cloudways • Ad
Fully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
OpenCausaLab / ADAM
View on GitHub
We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…
☆27Apr 7, 2025Updated 11 months ago
lizaijing / SAEval-Benchmark
View on GitHub
SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.
☆14Apr 29, 2024Updated last year
expectorlin / CONSOLE
View on GitHub
Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)
☆16Jun 7, 2024Updated last year
JiuTian-VL / LION-FS
View on GitHub
[CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
☆27Dec 2, 2025Updated 3 months ago
ylwhxht / MSGNav
View on GitHub
CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
☆35Mar 17, 2026Updated last week
CraftJarvis / GROOT
View on GitHub
GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)
☆67Dec 18, 2023Updated 2 years ago
ChenMnZ / SMMix
View on GitHub
[ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"
☆16Jul 14, 2023Updated 2 years ago
viemccoy / grimoire
View on GitHub
Synthetic Hypertext and Homomorphic Catalogue
☆15Dec 28, 2024Updated last year
jeasinema / egl-docker
View on GitHub
A customized docker for headless GPU rendering without host-side configuration
☆10Aug 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling on Cloudways • Ad
Fully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
lucaspk512 / vrdone
View on GitHub
Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".
☆11Nov 13, 2024Updated last year
syscalldev / FutureGPT
View on GitHub
⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…
☆12May 14, 2023Updated 2 years ago
IT3DEgo / IT3DEgo
View on GitHub
CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"
☆19Jun 27, 2024Updated last year
tiiuae / FineLIP
View on GitHub
code for FineLIP
☆40Nov 25, 2025Updated 4 months ago
PantheonInfer / Pantheon
View on GitHub
Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"
☆16Apr 15, 2024Updated last year
Zhengbo-Zhang / FADE
View on GitHub
Code of paper "A Video Dataset for Falling Object Detection around Buildings" https://arxiv.org/abs/2408.05750
☆18Jul 10, 2025Updated 8 months ago
ai4ce / INT-ACT
View on GitHub
Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
☆32Nov 2, 2025Updated 4 months ago
ameesh-shah / liten-vla
View on GitHub
LITEN: Learning from Inference Time Execution for VLAs
☆27Oct 23, 2025Updated 5 months ago
ml-postech / BEAG
View on GitHub
☆11Jul 4, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
AccomplishedCode / Deep-Reinforcement-Learning-Stock-Trader
View on GitHub
☆13Apr 28, 2019Updated 6 years ago
ControlNet / HYDRA
View on GitHub
[ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
☆25Sep 6, 2025Updated 6 months ago
mlvlab / RALF
View on GitHub
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
☆45Sep 12, 2024Updated last year
CUHKWilliam / GeoManip-release
View on GitHub
☆13Apr 22, 2025Updated 11 months ago
Ken-Chy129 / student-course-choosing
View on GitHub
学生选课系统
☆11Mar 1, 2023Updated 3 years ago
natterman12 / Golf-Ball-Tracking-and-Speed-Detection
View on GitHub
A small project to track and calculate the speed from a putt.
☆20Oct 26, 2023Updated 2 years ago
CraftJarvis / OpenHA
View on GitHub
Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"
☆28Feb 5, 2026Updated last month
augcog / DTTD2
View on GitHub
[CVPRW 2025] Official repository of DTTDNet: Robust Digital-Twin Localization via An RGBD-based Transformer Network and A Comprehensive E…
☆22Nov 17, 2025Updated 4 months ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆15Jan 18, 2026Updated 2 months ago
NordVPN Threat Protection Pro™ • Ad
Take your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
ChgygLin / TrackNetV2-pytorch
View on GitHub
A Pytorch implementation of TrackNetV2 from Tensorflow (ncnn c++ inference)
☆54Nov 3, 2024Updated last year
ml-postech / multi-armed-bandit-algorithm-against-strategic-replication
View on GitHub
Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"
☆14May 17, 2022Updated 3 years ago
sichun233746 / MoTIF
View on GitHub
MoTIF: Learning Motion Trajectories with Local Implicit Neural Functions for Continuous Space-Time Video Super-Resolution
☆38Sep 30, 2023Updated 2 years ago
augcog / DTTDv1
View on GitHub
[CVPRW 2023] Official repository of "Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Ap…
☆24Nov 22, 2024Updated last year
yanghan-yh / MCA-Ctrl
View on GitHub
CVPR2025-Multi-party Collaborative Attention Control for Image Customization
☆16May 14, 2025Updated 10 months ago
StoreBlank / KUDA
View on GitHub
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
☆22Apr 23, 2025Updated 11 months ago
CraftJarvis / MC-Controller
View on GitHub
Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"
☆46Aug 15, 2023Updated 2 years ago