[CVPR2024] This is the official implement of MP5
☆106Jun 30, 2024Updated last year
Alternatives and similar repositories for MP5
Users that are interested in MP5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆103Jun 16, 2025Updated 11 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago
- ☆53Feb 8, 2025Updated last year
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- ☆49Dec 11, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15May 21, 2026Updated 3 weeks ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆211Jun 4, 2024Updated 2 years ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated 2 years ago
- JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models☆397Apr 8, 2024Updated 2 years ago
- ☆11Oct 25, 2024Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents☆317Apr 16, 2024Updated 2 years ago
- Paper List of Minecraft Agents☆69May 24, 2026Updated 3 weeks ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆62Jul 21, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆74Jan 19, 2026Updated 4 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 3 years ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated last year
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆132Sep 2, 2025Updated 9 months ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…☆165Sep 3, 2024Updated last year
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- ☆30May 22, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 9 months ago
- A RLHF Infrastructure for Vision-Language Models☆200Nov 15, 2024Updated last year
- Odyssey: Empowering Minecraft Agents with Open-World Skills☆391Oct 22, 2025Updated 7 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆101Jun 17, 2025Updated 11 months ago
- [TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"☆16May 12, 2025Updated last year
- [CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy☆25Jun 17, 2025Updated 11 months ago
- ☆17Apr 5, 2023Updated 3 years ago
- A list of awesome and popular robot learning environments☆122Aug 17, 2024Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68May 31, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Aug 21, 2024Updated last year
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆31Apr 7, 2025Updated last year
- HAZARD challenge☆38Apr 27, 2025Updated last year
- Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object☆19Dec 1, 2024Updated last year
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Aug 24, 2025Updated 9 months ago
- This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥☆1,815Updated this week
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆11Oct 11, 2024Updated last year