The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆89Jan 16, 2026Updated 2 months ago
Alternatives and similar repositories for Mantis
Users that are interested in Mantis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Updated this week
- ☆39May 20, 2025Updated 10 months ago
- Code for orthogonal neural operator☆18Oct 15, 2023Updated 2 years ago
- The repository provides code for EgoMAN model and dataset creation scripts.☆30Dec 31, 2025Updated 3 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆10Mar 21, 2023Updated 3 years ago
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆194Jan 4, 2026Updated 3 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 9 months ago
- Memory-Dependent Manipulation Benchmark based on RoboTwin☆94Mar 30, 2026Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- ☆16Mar 26, 2025Updated last year
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated 2 years ago
- EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video☆212Aug 20, 2025Updated 7 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting (NOSSDAV 2025)☆31Aug 15, 2025Updated 7 months ago
- ☆41Jun 9, 2025Updated 10 months ago
- Raspberry Pi biped robot - using python☆13May 23, 2015Updated 10 years ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆33Oct 30, 2025Updated 5 months ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆112Jan 27, 2026Updated 2 months ago
- [RA-L] DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding☆18Apr 17, 2024Updated last year
- Code for IROS 2024 paper "AutoNeRF: Training Implicit Scene Representations with Autonomous Agents"☆17Oct 24, 2024Updated last year
- ☆35Nov 17, 2025Updated 4 months ago
- Code for Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation☆24Apr 15, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the ATRIAS bipedal robot. Contains the hardware interface, controllers, simulation, and post-processing tools.☆12Mar 19, 2015Updated 11 years ago
- Hands-On Image Processing with Python, Second Edition, Published by Packt☆27Mar 17, 2026Updated 3 weeks ago
- [ECAI 2024] TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement☆13Oct 16, 2024Updated last year
- 🚗🗣️📡🗾 🏁 A framework for navigation tasks that can build the 3D scene graph in real-time and utilize large language model (LLM) to gui…☆25Oct 14, 2024Updated last year
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 6 months ago
- Official implementation of GR-MG☆92Jan 12, 2025Updated last year
- ☆38Jan 25, 2026Updated 2 months ago
- ☆31Sep 12, 2025Updated 7 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆22Apr 18, 2025Updated 11 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆54Dec 20, 2024Updated last year
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆127Feb 14, 2025Updated last year
- ☆33Jul 15, 2025Updated 8 months ago
- Source code for [ECCV2024]O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation☆24Mar 23, 2025Updated last year
- ☆10Feb 3, 2026Updated 2 months ago