iLearn-Lab/CVPR25-Optimus-2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iLearn-Lab/CVPR25-Optimus-2)

iLearn-Lab / CVPR25-Optimus-2

[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

☆27

Alternatives and similar repositories for CVPR25-Optimus-2

Users that are interested in CVPR25-Optimus-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MineAnyBuild / MineAnyBuild
View on GitHub
Code and benchmark of the paper "MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents" (NeurIPS D&B 2025)
☆15Oct 13, 2025Updated 9 months ago
iLearn-Lab / CVPR26-OptimusVLA
View on GitHub
[CVPR 2026] Official Implementation for Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Effi…
☆25Updated this week
iLearn-Lab / MM2023-FGKVMemPred_video
View on GitHub
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
☆23Jul 11, 2024Updated 2 years ago
cszzx / GRAIN
View on GitHub
[CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations
☆13Jul 14, 2022Updated 4 years ago
aubokani / Bandwidth-Dataset
View on GitHub
Mobile network bandwidth traces
☆14Oct 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
expectorlin / CONSOLE
View on GitHub
Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)
☆16Jun 7, 2024Updated 2 years ago
TAMS-Group / tams_glass_reconstruction
View on GitHub
Detection and Reconstruction of Transparent Objects with Infrared Projection-based RGB-D Cameras
☆13Jan 17, 2021Updated 5 years ago
iLearn-Lab / CVPR25-LION-FS
View on GitHub
[CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
☆29Dec 2, 2025Updated 7 months ago
gjhhust / XS-VID
View on GitHub
XS-VID: An Extra Small Object Video Detection Dataset
☆10Mar 4, 2025Updated last year
sisuolv / 2021--CCF-Big-Data-Computing-Intelligence-Contest--Script-character-emotion-recognition--5th
View on GitHub
https://www.datafountain.cn/competitions/518
☆13Mar 1, 2023Updated 3 years ago
IBM / comical
View on GitHub
Contrastive multi-omics association learning
☆13Apr 28, 2026Updated 2 months ago
mmmmmm44 / tennis_court_detection
View on GitHub
Python Implementation of paper "Robust Camera Calibration for Sport Videos using Court Models"
☆14Nov 15, 2023Updated 2 years ago
jeasinema / egl-docker
View on GitHub
A customized docker for headless GPU rendering without host-side configuration
☆11Aug 22, 2022Updated 3 years ago
sisuolv / CVPR--Sorghum--100-Cultivar-Identification--FGVC-9--3rd
View on GitHub
https://www.kaggle.com/competitions/sorghum-id-fgvc-9
☆19Mar 1, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
PantheonInfer / Pantheon
View on GitHub
Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"
☆16Apr 15, 2024Updated 2 years ago
CUHKWilliam / GeoManip-release
View on GitHub
☆12Apr 22, 2025Updated last year
donglaiw / AoT_Dataset
View on GitHub
CVPR18: Learning and Using the Arrow of Time
☆40Feb 11, 2022Updated 4 years ago
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago
sichun233746 / MoTIF
View on GitHub
MoTIF: Learning Motion Trajectories with Local Implicit Neural Functions for Continuous Space-Time Video Super-Resolution
☆38Sep 30, 2023Updated 2 years ago
chenyuntc / keypoint
View on GitHub
Implemention of "Realtime Multi Person Pose-Estimation" in pytorch with data from AI Challenger
☆13Nov 24, 2017Updated 8 years ago
roudimit / c2kd
View on GitHub
Code for the C2KD paper (ICASSP 2023)
☆20May 15, 2023Updated 3 years ago
XiaokunFeng / CTVLT
View on GitHub
[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
☆19Dec 31, 2024Updated last year
RWLinno / sojump-helper
View on GitHub
这是一个问卷星互填社区刷点数的工具，进而帮您更快采集自己的样本
☆11Oct 31, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CraftJarvis / MC-Controller
View on GitHub
Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"
☆47Aug 15, 2023Updated 2 years ago
iLearn-Lab / AAAI26-SemanticVLA
View on GitHub
[AAAI 2026 Oral] SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
☆70Apr 5, 2026Updated 3 months ago
alexanderswerdlow / faster
View on GitHub
☆29Jun 30, 2026Updated 2 weeks ago
NipunaRanasinghe / awesome-ai-agents
View on GitHub
A curated list of frameworks, tools, and resources for building and deploying AI agents. From multi-agent systems to autonomous coding as…
☆35Jul 13, 2026Updated last week
RWLinno / ViT-Model-based-Medical-Image-Assisted-Diagnostic-System
View on GitHub
基于ViT模型的医疗图像辅助诊断系统
☆11Jan 30, 2024Updated 2 years ago
Minyoung1005 / motif
View on GitHub
☆22Apr 17, 2026Updated 3 months ago
mayhugotong / VideoINSTA
View on GitHub
This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…
☆24Apr 7, 2026Updated 3 months ago
weekgoodday / LagMemo
View on GitHub
LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation
☆18Jun 17, 2026Updated last month
Aiden0526 / MuSLR
View on GitHub
Coda and Data for NeurIPS 2025 paper "MuSLR: Multimodal Symbolic Logical Reasoning"
☆16Oct 5, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yoshall / SINPA
View on GitHub
☆15Aug 5, 2025Updated 11 months ago
Event-AHU / Cross_Resolution_SOT
View on GitHub
[IEEE TMM 2025] CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras
☆22Jan 18, 2025Updated last year
cgjacklin / transmdot
View on GitHub
TransMDOT
☆22Jan 8, 2024Updated 2 years ago
jzsherlock4869 / lowlevel-book-codebase
View on GitHub
Source code for book "Image algorithms for low-level vision tasks" (Jia. 2024), including denoising, super-resolution, dehazing, image co…
☆20Jul 19, 2025Updated last year
FireRedTeam / IVC-Prune
View on GitHub
IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning
☆16Feb 27, 2026Updated 4 months ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 3 years ago
yifan12wu / rl-laplacian
View on GitHub
Learning Laplacian Representations in Reinforcement Learning
☆18Jan 2, 2021Updated 5 years ago