rese1f / STEVEView external linksLinks
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for STEVE
Users that are interested in STEVE are comparing it to the libraries listed below
Sorting:
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆26Apr 7, 2025Updated 10 months ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆138Jun 4, 2025Updated 8 months ago
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆102Jun 16, 2025Updated 7 months ago
- [ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation☆12Aug 28, 2023Updated 2 years ago
- 【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge☆15Jul 18, 2023Updated 2 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 5 months ago
- ☆39Jan 12, 2026Updated last month
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Nov 21, 2024Updated last year
- Meshcapade support for Unreal Editor for Fortnite (UEFN)☆22Apr 17, 2024Updated last year
- 🏙️🌆🌃 Try Infinite and Controllable 3D City Layout Generation!☆44Dec 10, 2023Updated 2 years ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Dec 9, 2024Updated last year
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆22Jan 30, 2026Updated 2 weeks ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆94Jun 17, 2025Updated 7 months ago
- ☆23Nov 9, 2023Updated 2 years ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆52Dec 28, 2025Updated last month
- Evaluation codes and data for GenEval2☆55Jan 8, 2026Updated last month
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- [AAAI‘ 2025 ] "AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement".☆67Mar 9, 2025Updated 11 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- An Empirical Study of GPT-4o Image Generation Capabilities☆29Apr 16, 2025Updated 9 months ago
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆43Oct 15, 2025Updated 3 months ago
- AAAI 2024: VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook☆28Dec 24, 2023Updated 2 years ago
- Checkpoint for Voyager, 160 iterations.☆23May 27, 2023Updated 2 years ago
- Actually released!☆10Feb 24, 2021Updated 4 years ago
- RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark (ECCV 2024)☆31Sep 10, 2024Updated last year
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆365Oct 22, 2025Updated 3 months ago
- A curated list of awesome DUST3R/MAST3R related papers.☆35Aug 5, 2025Updated 6 months ago
- 📚 A collection of resources and papers on Large Language Models in autonomous driving☆27Oct 30, 2023Updated 2 years ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35May 8, 2025Updated 9 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- ☆37Nov 8, 2024Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Oct 3, 2024Updated last year
- ☆37Jun 23, 2025Updated 7 months ago
- ☆45Sep 7, 2025Updated 5 months ago