[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for STEVE
Users that are interested in STEVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆47Dec 11, 2023Updated 2 years ago
- [CVPR2024] This is the official implement of MP5☆108Jun 30, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆33Nov 1, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆27Apr 7, 2025Updated 11 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 6 months ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆141Jun 4, 2025Updated 9 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆96Jun 17, 2025Updated 9 months ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- Paper List of Minecraft Agents☆58Mar 6, 2026Updated 2 weeks ago
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 5 months ago
- Checkpoint for Voyager, 160 iterations.☆23May 27, 2023Updated 2 years ago
- ☆40Jan 12, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding☆689Jan 29, 2025Updated last year
- [ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation☆16Nov 1, 2022Updated 3 years ago
- [ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation☆12Aug 28, 2023Updated 2 years ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Mar 4, 2024Updated 2 years ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Audio propagation engine - Meta Reality Labs Research.☆23Nov 1, 2022Updated 3 years ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆35Aug 7, 2025Updated 7 months ago
- Foundation Model for MineDojo☆297Apr 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆368Oct 22, 2025Updated 5 months ago
- 🏙️🌆🌃 Try Infinite and Controllable 3D City Layout Generation!☆43Dec 10, 2023Updated 2 years ago
- ☆47Sep 7, 2025Updated 6 months ago
- An Empirical Study of GPT-4o Image Generation Capabilities☆29Apr 16, 2025Updated 11 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Dec 9, 2024Updated last year
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆199Mar 6, 2024Updated 2 years ago
- A data processing module implemented with numpy☆10Aug 16, 2022Updated 3 years ago
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…☆638Jun 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for GPT Reviews — a daily AI-generated podcast☆18Aug 14, 2024Updated last year
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- Evaluation codes and data for GenEval2☆60Jan 8, 2026Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆46Oct 15, 2025Updated 5 months ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated last year
- Topic Model based on Pretrained Sentence Embeddings (with BERT)☆13Feb 8, 2023Updated 3 years ago