[NeurIPS'2025] "OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis"
☆28Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for OWMM-Agent
Users that are interested in OWMM-Agent are comparing it to the libraries listed below
Sorting:
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆33Feb 23, 2026Updated 2 weeks ago
- ☆57Feb 2, 2026Updated last month
- ☆32Feb 3, 2026Updated last month
- [AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images☆13Nov 10, 2024Updated last year
- ☆10Dec 6, 2019Updated 6 years ago
- ☆37Dec 18, 2025Updated 2 months ago
- ☆24Feb 2, 2026Updated last month
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆21Dec 11, 2025Updated 2 months ago
- ☆15Jun 14, 2025Updated 8 months ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- This repository is the official implementation of Low-Rank Modular Reinforcement Learning via Muscle Synergy.☆11Oct 27, 2022Updated 3 years ago
- Official repository for GraphEQA☆22Sep 25, 2025Updated 5 months ago
- Code listing for the paper 'SATAR: A Self-supervised Approach to Twitter Account Representation Learning and its Application in Bot Detec…☆10Nov 1, 2021Updated 4 years ago
- ☆32Sep 19, 2025Updated 5 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆88Jun 10, 2025Updated 8 months ago
- ☆11Oct 16, 2020Updated 5 years ago
- A small storytelling LLM running on the PS Vita☆27Jun 12, 2025Updated 8 months ago
- Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"☆10Apr 23, 2023Updated 2 years ago
- Official Implementation of "Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting" (CW2025 Best Paper Honorable Me…☆23Oct 19, 2025Updated 4 months ago
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆29Dec 3, 2025Updated 3 months ago
- ☆11Jul 16, 2024Updated last year
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 3 months ago
- A C++ ROS simulator for reactive social robot navigation based on Enzymatic Numerical P systems.☆12Feb 23, 2021Updated 5 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- The project repository for paper EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents: https://arxiv.org/abs…☆63Jan 6, 2025Updated last year
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆35Feb 1, 2026Updated last month
- ☆19Dec 20, 2025Updated 2 months ago
- [ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆29Jan 14, 2026Updated last month
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- ☆17Mar 2, 2026Updated last week
- Where is this IP?☆14Feb 24, 2024Updated 2 years ago
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 10 months ago
- CNN For Fish Training☆27Jul 9, 2025Updated 8 months ago
- Noah -- fixing your computer issues☆47Updated this week
- [ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"☆13Oct 8, 2022Updated 3 years ago
- Official Repository for the ACM MM 2024 paper "Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments"☆15May 16, 2025Updated 9 months ago
- [NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.☆47Jan 28, 2026Updated last month