[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
☆25Jun 17, 2025Updated 11 months ago
Alternatives and similar repositories for CVPR25-Optimus-2
Users that are interested in CVPR25-Optimus-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆101Jun 17, 2025Updated 11 months ago
- [ACMMM 2022 Oral] Official Implementation for Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation☆11Apr 8, 2026Updated 2 months ago
- Paper List of Minecraft Agents☆69May 24, 2026Updated 2 weeks ago
- [CVPR 2025] Plug-and-Play Versatile Compressed Video Enhancement☆22Jan 19, 2026Updated 4 months ago
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆23Jul 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆31Apr 7, 2025Updated last year
- [CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations☆13Jul 14, 2022Updated 3 years ago
- ☆13May 13, 2025Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆30Dec 2, 2025Updated 6 months ago
- Detection and Reconstruction of Transparent Objects with Infrared Projection-based RGB-D Cameras☆13Jan 17, 2021Updated 5 years ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆103Jun 16, 2025Updated 11 months ago
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- Code for the paper "Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking"☆15Apr 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Jun 27, 2024Updated last year
- Official repository of " SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects" (IROS 2024)☆18Mar 9, 2025Updated last year
- Contrastive multi-omics association learning☆13Apr 28, 2026Updated last month
- 💬 Send iMessages using Python through the Shortcuts app.☆18May 25, 2024Updated 2 years ago
- detecting tennis court keypoints with yolo☆10Apr 19, 2026Updated last month
- Multimodal datasets.☆34Jan 26, 2024Updated 2 years ago
- ☆10May 5, 2024Updated 2 years ago
- Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"☆16Apr 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆12Nov 13, 2024Updated last year
- Code of paper "A Video Dataset for Falling Object Detection around Buildings" https://arxiv.org/abs/2408.05750☆19Jul 10, 2025Updated 10 months ago
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated last year
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago
- [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning☆26Sep 6, 2025Updated 9 months ago
- ☆13Apr 28, 2019Updated 7 years ago
- A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling☆15Dec 5, 2023Updated 2 years ago
- A small project to track and calculate the speed from a putt.☆20Oct 26, 2023Updated 2 years ago
- ☆12Apr 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆34May 10, 2026Updated 3 weeks ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- ☆16Apr 14, 2026Updated last month
- [CVPRW 2025] Official repository of DTTDNet: Robust Digital-Twin Localization via An RGBD-based Transformer Network and A Comprehensive E…☆24Apr 9, 2026Updated 2 months ago
- ☆20Apr 14, 2023Updated 3 years ago
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"☆14May 17, 2022Updated 4 years ago
- [CVPRW 2023] Official repository of "Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Ap…☆24Nov 22, 2024Updated last year