iLearn-Lab/NeurIPS24-Optimus-1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iLearn-Lab/NeurIPS24-Optimus-1)

iLearn-Lab / NeurIPS24-Optimus-1

[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

☆102

Alternatives and similar repositories for NeurIPS24-Optimus-1

Users that are interested in NeurIPS24-Optimus-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iLearn-Lab / CVPR25-Optimus-2
View on GitHub
[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
☆27Jun 17, 2025Updated last year
iLearn-Lab / ICML24-RoboMP2
View on GitHub
[ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…
☆12Apr 4, 2026Updated 3 months ago
iLearn-Lab / MM2023-FGKVMemPred_video
View on GitHub
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
☆23Jul 11, 2024Updated 2 years ago
CraftJarvis / ROCKET-1
View on GitHub
Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR'25)
☆47Apr 13, 2025Updated last year
elated-sawyer / WALL-E
View on GitHub
Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
☆63Dec 3, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JiuTian-VL / SimpAgent
View on GitHub
[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆48Mar 12, 2026Updated 4 months ago
Shalev-Lifshitz / STEVE-1
View on GitHub
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
☆214Jun 4, 2024Updated 2 years ago
CraftJarvis / GROOT
View on GitHub
GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)
☆71Dec 18, 2023Updated 2 years ago
XuRui314 / GLM4v-Finetune
View on GitHub
Support finetuning GLM4v with zero2
☆16Jun 29, 2024Updated 2 years ago
iLearn-Lab / CVPR26-OptimusVLA
View on GitHub
[CVPR 2026] Official Implementation for Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Effi…
☆25Updated this week
BAAI-Agents / GPA-LM
View on GitHub
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Met…
☆164Sep 3, 2024Updated last year
CraftJarvis / JARVIS-1
View on GitHub
JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models
☆406Apr 8, 2024Updated 2 years ago
CraftJarvis / OmniJARVIS
View on GitHub
☆30Jun 25, 2024Updated 2 years ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PKU-RL / Creative-Agents
View on GitHub
☆50Dec 11, 2023Updated 2 years ago
OpenCausaLab / ADAM
View on GitHub
We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…
☆33Apr 7, 2025Updated last year
MetabrainAGI / Awaker2.5-VL
View on GitHub
☆35Jan 21, 2025Updated last year
Zhoues / MineDreamer
View on GitHub
[IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…
☆103Jun 16, 2025Updated last year
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
mindagent / mindagent
View on GitHub
☆102Jun 12, 2024Updated 2 years ago
IranQin / MP5
View on GitHub
[CVPR2024] This is the official implement of MP5
☆105Jun 30, 2024Updated 2 years ago
thu-coai / SPaR
View on GitHub
☆47Jun 11, 2025Updated last year
xieyuquanxx / awesome-Large-MultiModal-Hallucination
View on GitHub
😎 curated list of awesome LMM hallucinations papers, methods & resources.
☆150Mar 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YihongDong / RL-PLUS
View on GitHub
☆27Aug 31, 2025Updated 10 months ago
GasolSun36 / SURf
View on GitHub
[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆11Oct 11, 2024Updated last year
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
amazon-science / PAE
View on GitHub
☆70Mar 6, 2025Updated last year
silence143 / EMMOE
View on GitHub
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
☆28May 15, 2025Updated last year
zju-vipa / Odyssey
View on GitHub
Odyssey: Empowering Minecraft Agents with Open-World Skills
☆396Oct 22, 2025Updated 8 months ago
GenEx-world / genex
View on GitHub
Generative World Explorer
☆167Jun 14, 2025Updated last year
iLearn-Lab / ACL26-PersonalAlign
View on GitHub
[ACL 2026 main] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
☆21Apr 11, 2026Updated 3 months ago
Raj-08 / Q-Flow
View on GitHub
Complete Reinforcement Learning Toolkit for Large Language Models!
☆21Aug 2, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
allenai / clin
View on GitHub
☆89Dec 15, 2023Updated 2 years ago
CraftJarvis / OpenHA
View on GitHub
Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"
☆41Jun 5, 2026Updated last month
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated 3 weeks ago
MraDonkey / rethinking_prompting
View on GitHub
[ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…
☆18Aug 15, 2025Updated 11 months ago
CraftJarvis / MC-TextWorld
View on GitHub
Text world based on Minecraft rules.
☆18May 13, 2024Updated 2 years ago
cszzx / GRAIN
View on GitHub
[CVPR 2022 Oral] Faithful Extreme Rescaling via Generative Prior Reciprocated Invertible Representations
☆13Jul 14, 2022Updated 4 years ago
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago