[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
☆25Jun 17, 2025Updated last year
Alternatives and similar repositories for CVPR25-Optimus-2
Users that are interested in CVPR25-Optimus-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆20Nov 11, 2025Updated 7 months ago
- A Soul-grounded Minecraft social simulation runtime where Mineflayer actors pursue LifeGoals through evidence-backed action skills and tr…☆24Jun 18, 2026Updated last week
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆23Jul 11, 2024Updated last year
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆27Jul 11, 2024Updated last year
- Deep Double Incomplete Multi-view Multi-label Learning with Incomplete Labels and Missing Views☆14Apr 7, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations☆13Jul 14, 2022Updated 3 years ago
- ☆17Sep 23, 2023Updated 2 years ago
- Code of the paper "Correctable Landmark Discovery via Large Models for Vision-Language Navigation" (TPAMI 2024)☆16Jun 7, 2024Updated 2 years ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆24Oct 8, 2025Updated 8 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆68Dec 18, 2023Updated 2 years ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆29Dec 2, 2025Updated 6 months ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Jul 14, 2023Updated 2 years ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆103Jun 16, 2025Updated last year
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- XS-VID: An Extra Small Object Video Detection Dataset☆10Mar 4, 2025Updated last year
- [ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification☆49Mar 12, 2026Updated 3 months ago
- ☆34Oct 19, 2025Updated 8 months ago
- PyTorch implementation of SegBlocks: Towards Block-Based Adaptive Resolution Networks for Fast Segmentation (ECCV2020 Embedded Vision Wor…☆19Mar 31, 2023Updated 3 years ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆19Jun 27, 2024Updated 2 years ago
- Official repository of " SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects" (IROS 2024)☆18Mar 9, 2025Updated last year
- [ICME 2025] Official Implementation for "VADMamba: Exploring State Space Models for Fast Video Anomaly Detection"☆16Dec 21, 2025Updated 6 months ago
- Contrastive multi-omics association learning☆13Apr 28, 2026Updated 2 months ago
- detecting tennis court keypoints with yolo☆10Apr 19, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10May 5, 2024Updated 2 years ago
- Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"☆16Apr 15, 2024Updated 2 years ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆12Nov 13, 2024Updated last year
- Code of paper "A Video Dataset for Falling Object Detection around Buildings" https://arxiv.org/abs/2408.05750☆20Jul 10, 2025Updated 11 months ago
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 8 months ago
- ☆11Jul 4, 2024Updated last year
- A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling☆15Dec 5, 2023Updated 2 years ago
- A small project to track and calculate the speed from a putt.☆21Oct 26, 2023Updated 2 years ago
- ☆12Apr 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPRW 2025] Official repository of DTTDNet: Robust Digital-Twin Localization via An RGBD-based Transformer Network and A Comprehensive E…☆25Apr 9, 2026Updated 2 months ago
- ☆16Apr 14, 2026Updated 2 months ago
- Python Implementation of paper "Robust Camera Calibration for Sport Videos using Court Models"☆14Nov 15, 2023Updated 2 years ago
- Code for the C2KD paper (ICASSP 2023)☆19May 15, 2023Updated 3 years ago
- 二次元猜猜呗(弗/灯/亚一把)一键自动快速部署☆24May 25, 2025Updated last year
- A Pytorch implementation of TrackNetV2 from Tensorflow (ncnn c++ inference)☆60Nov 3, 2024Updated last year
- [CVPRW 2023] Official repository of "Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Ap…☆24Nov 22, 2024Updated last year