wenhaochai/STEVE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenhaochai/STEVE)

wenhaochai / STEVE

[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment

☆41

Alternatives and similar repositories for STEVE

Users that are interested in STEVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenhaochai / UniAP
View on GitHub
[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
☆12Dec 10, 2023Updated 2 years ago
wenhaochai / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Jul 21, 2026Updated last week
IranQin / MP5
View on GitHub
[CVPR2024] This is the official implement of MP5
☆105Jun 30, 2024Updated 2 years ago
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆34Apr 20, 2026Updated 3 months ago
Rem105-210 / DiffFashion
View on GitHub
☆82Mar 15, 2023Updated 3 years ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
wenhaochai / aurora
View on GitHub
[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
☆147Jun 4, 2025Updated last year
OpenCausaLab / ADAM
View on GitHub
We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…
☆33Apr 7, 2025Updated last year
iLearn-Lab / NeurIPS24-Optimus-1
View on GitHub
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆102Jun 17, 2025Updated last year
GXYM / VCapsBench
View on GitHub
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
☆20Jun 2, 2025Updated last year
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year
hbing-l / PoSynDA
View on GitHub
[ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation
☆12Aug 28, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
daswer123 / Voyager_checkpoint
View on GitHub
Checkpoint for Voyager, 160 iterations.
☆23May 27, 2023Updated 3 years ago
YushengZhao / TD-STP
View on GitHub
[ACM MM 2022] Target-Driven Structured Transformer Planner for Vision-Language Navigation
☆16Nov 1, 2022Updated 3 years ago
tomchen-ctj / CVPR23-LOVEU-AQTC
View on GitHub
【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge
☆15Jul 18, 2023Updated 3 years ago
Video-MAC / VideoMAC
View on GitHub
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆16May 12, 2026Updated 2 months ago
DeckardAgent / deckard
View on GitHub
Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"
☆94May 23, 2023Updated 3 years ago
facebookresearch / rlr-audio-propagation
View on GitHub
Audio propagation engine - Meta Reality Labs Research.
☆24Nov 1, 2022Updated 3 years ago
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
chenjinyubuaa / SEvol
View on GitHub
☆14Sep 21, 2022Updated 3 years ago
TAU-VAILab / ProtoSnap
View on GitHub
☆41Jan 12, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zju-vipa / Odyssey
View on GitHub
Odyssey: Empowering Minecraft Agents with Open-World Skills
☆397Oct 22, 2025Updated 9 months ago
wenhaochai / CityGen
View on GitHub
🏙️🌆🌃 Try Infinite and Controllable 3D City Layout Generation!
☆44Dec 10, 2023Updated 2 years ago
collovlabs / ViewControl
View on GitHub
[IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"
☆25Aug 27, 2024Updated last year
mattdeitke / objaverse-xl-test-files
View on GitHub
☆12Sep 11, 2023Updated 2 years ago
PKU-RL / Plan4MC
View on GitHub
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆200Mar 6, 2024Updated 2 years ago
ameesh-shah / liten-vla
View on GitHub
LITEN: Learning from Inference Time Execution for VLAs
☆27Oct 23, 2025Updated 9 months ago
OpenGVLab / GITM
View on GitHub
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…
☆642Jun 5, 2023Updated 3 years ago
twelvelabs-io / pegasus-1-eval
View on GitHub
Repository for evaluating Pegasus-1 and video-language foundation models
☆14Nov 12, 2024Updated last year
Meshcapade / mc-uefn
View on GitHub
Meshcapade support for Unreal Editor for Fortnite (UEFN)
☆22Apr 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
Y-ichen / FlexiFilm
View on GitHub
FlexiFilm: Long Video Generation with Flexible Conditions
☆31May 1, 2024Updated 2 years ago
Cities2Modding / MOOB
View on GitHub
Map Optimization and Ongoing Bug-fixing mod for Cities Skylines 2
☆10Dec 1, 2023Updated 2 years ago
JohnTailor / BertSenClu
View on GitHub
Topic Model based on Pretrained Sentence Embeddings (with BERT)
☆13Feb 8, 2023Updated 3 years ago
shengyuhao / DIVOTrack
View on GitHub
A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes (IJCV 2024)
☆100Nov 13, 2025Updated 8 months ago
elated-sawyer / WALL-E
View on GitHub
Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
☆64Dec 3, 2025Updated 7 months ago
Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆13Aug 13, 2024Updated last year