rese1f/STEVE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rese1f/STEVE)

rese1f / STEVE

[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment

☆41

Alternatives and similar repositories for STEVE

Users that are interested in STEVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-RL / Creative-Agents
View on GitHub
☆48Dec 11, 2023Updated 2 years ago
wenhaochai / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
rese1f / UniAP
View on GitHub
[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
☆12Dec 10, 2023Updated 2 years ago
IranQin / MP5
View on GitHub
[CVPR2024] This is the official implement of MP5
☆108Jun 30, 2024Updated last year
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Owen718 / AWRCP
View on GitHub
ICCV'23 | Adverse Weather Removal with Codebook Priors
☆10Aug 28, 2023Updated 2 years ago
OpenCausaLab / ADAM
View on GitHub
We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…
☆28Apr 7, 2025Updated last year
Zhoues / MineDreamer
View on GitHub
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆103Jun 16, 2025Updated 9 months ago
Rem105-210 / DiffFashion
View on GitHub
☆81Mar 15, 2023Updated 3 years ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆16Sep 12, 2025Updated 7 months ago
rese1f / aurora
View on GitHub
[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
☆141Jun 4, 2025Updated 10 months ago
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆17Jun 2, 2025Updated 10 months ago
CraftJarvis / MC-TextWorld
View on GitHub
Text world based on Minecraft rules.
☆17May 13, 2024Updated last year
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lizaijing / Awesome-Minecraft-Agent
View on GitHub
Paper List of Minecraft Agents
☆60Mar 6, 2026Updated last month
daswer123 / Voyager_checkpoint
View on GitHub
Checkpoint for Voyager, 160 iterations.
☆23May 27, 2023Updated 2 years ago
TAU-VAILab / ProtoSnap
View on GitHub
☆41Jan 12, 2026Updated 3 months ago
rese1f / MovieChat
View on GitHub
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
☆690Jan 29, 2025Updated last year
YushengZhao / TD-STP
View on GitHub
[ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation
☆16Nov 1, 2022Updated 3 years ago
Video-MAC / VideoMAC
View on GitHub
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆12Mar 4, 2024Updated 2 years ago
DanielLin97 / FACT-AUDIT
View on GitHub
An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
☆15Feb 27, 2025Updated last year
tomchen-ctj / CVPR23-LOVEU-AQTC
View on GitHub
【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge
☆15Jul 18, 2023Updated 2 years ago
Shalev-Lifshitz / STEVE-1
View on GitHub
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
☆205Jun 4, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
zju-vipa / Odyssey
View on GitHub
Odyssey: Empowering Minecraft Agents with Open-World Skills
☆374Oct 22, 2025Updated 5 months ago
rese1f / CityGen
View on GitHub
🏙️🌆🌃 Try Infinite and Controllable 3D City Layout Generation!
☆43Dec 10, 2023Updated 2 years ago
showlab / Q2A
View on GitHub
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
☆23Jan 30, 2026Updated 2 months ago
teamcraft-bench / teamcraft
View on GitHub
☆48Sep 7, 2025Updated 7 months ago
collovlabs / ViewControl
View on GitHub
[IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"
☆25Aug 27, 2024Updated last year
neu-vi / FleVRS
View on GitHub
FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024
☆22Dec 9, 2024Updated last year
mattdeitke / objaverse-xl-test-files
View on GitHub
☆12Sep 11, 2023Updated 2 years ago
PKU-RL / Plan4MC
View on GitHub
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆200Mar 6, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jiachenlei / maskdm
View on GitHub
☆22Nov 9, 2023Updated 2 years ago
OpenGVLab / GITM
View on GitHub
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…
☆638Jun 5, 2023Updated 2 years ago
carefree0910 / carefree-data
View on GitHub
A data processing module implemented with numpy
☆10Aug 16, 2022Updated 3 years ago
Meshcapade / mc-uefn
View on GitHub
Meshcapade support for Unreal Editor for Fortnite (UEFN)
☆22Apr 17, 2024Updated last year
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
elated-sawyer / WALL-E
View on GitHub
Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
☆58Dec 3, 2025Updated 4 months ago
Y-ichen / FlexiFilm
View on GitHub
FlexiFilm: Long Video Generation with Flexible Conditions
☆31May 1, 2024Updated last year