The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆90Jan 16, 2026Updated 3 months ago
Alternatives and similar repositories for Mantis
Users that are interested in Mantis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated last week
- ☆39May 20, 2025Updated 11 months ago
- The repository provides code for EgoMAN model and dataset creation scripts.☆31Dec 31, 2025Updated 4 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- ☆10Mar 21, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆199Jan 4, 2026Updated 3 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- Memory-Dependent Manipulation Benchmark based on RoboTwin☆111Apr 19, 2026Updated last week
- ☆16Mar 26, 2025Updated last year
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated 2 years ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 2 months ago
- GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting (NOSSDAV 2025)☆34Aug 15, 2025Updated 8 months ago
- ☆42Jun 9, 2025Updated 10 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆33Oct 30, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Project Page of Paper "Drive in Corridors: Enhancing the Safety of End-to-end Autonomous Driving via Corridor Learning and Planning"☆29May 8, 2025Updated 11 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆40Jul 23, 2025Updated 9 months ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆112Jan 27, 2026Updated 3 months ago
- Code for IROS 2024 paper "AutoNeRF: Training Implicit Scene Representations with Autonomous Agents"☆17Oct 24, 2024Updated last year
- ☆35Nov 17, 2025Updated 5 months ago
- Test Realtime FIR/IIR Filter using FMAC (Filter Math ACCcelerator). The FMAC unit is built around a fixed point multiplier and accumulato…☆13Nov 10, 2021Updated 4 years ago
- [AAAI 2026] Official implementation of paper "UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human Trajector…☆59Mar 27, 2026Updated last month
- Code for Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation☆24Apr 15, 2025Updated last year
- ☆14Oct 11, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆64Apr 8, 2026Updated 3 weeks ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 7 months ago
- Official implementation of GR-MG☆91Jan 12, 2025Updated last year
- Hands-On Image Processing with Python, Second Edition, Published by Packt☆29Updated this week
- Adding confidence to the SPIN mesh.☆13Jun 25, 2023Updated 2 years ago
- ☆44Jan 30, 2026Updated 3 months ago
- ☆31Sep 12, 2025Updated 7 months ago
- A MCP Task Server☆11Mar 7, 2025Updated last year
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆22Apr 18, 2025Updated last year
- ☆41Jan 25, 2026Updated 3 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆54Dec 20, 2024Updated last year
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆125Feb 14, 2025Updated last year
- ☆33Jul 15, 2025Updated 9 months ago
- Source code for [ECCV2024]O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation☆24Mar 23, 2025Updated last year
- ☆12Jun 11, 2025Updated 10 months ago