[CVPR2024] This is the official implement of MP5
☆108Jun 30, 2024Updated last year
Alternatives and similar repositories for MP5
Users that are interested in MP5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆103Jun 16, 2025Updated 9 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago
- ☆52Feb 8, 2025Updated last year
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- ☆48Dec 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Jun 6, 2024Updated last year
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆205Jun 4, 2024Updated last year
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models☆393Apr 8, 2024Updated 2 years ago
- ☆11Oct 25, 2024Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆67Dec 18, 2023Updated 2 years ago
- [ICCV2025] RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation☆37Jul 21, 2025Updated 8 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆59Jul 21, 2025Updated 8 months ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆71Jan 19, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated last month
- ☆11Jul 11, 2023Updated 2 years ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆173Mar 8, 2025Updated last year
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆121Sep 2, 2025Updated 7 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆21May 15, 2025Updated 10 months ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 3 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.☆27Aug 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- ☆12Nov 5, 2024Updated last year
- [TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"☆14May 12, 2025Updated 11 months ago
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 7 months ago
- A RLHF Infrastructure for Vision-Language Models☆198Nov 15, 2024Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆99Jun 17, 2025Updated 9 months ago
- [CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy☆24Jun 17, 2025Updated 9 months ago
- A list of awesome and popular robot learning environments☆117Aug 17, 2024Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Checkpoint for Voyager, 160 iterations.☆23May 27, 2023Updated 2 years ago
- ☆19Aug 21, 2024Updated last year
- We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…☆28Apr 7, 2025Updated last year
- HAZARD challenge☆37Apr 27, 2025Updated 11 months ago
- This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥☆1,769Feb 12, 2026Updated 2 months ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- Code implementation for paper "Can Large Language Models Empower Molecular Property Prediction?"☆39Jul 14, 2023Updated 2 years ago