MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
☆69Nov 10, 2025Updated 6 months ago
Alternatives and similar repositories for MLA
Users that are interested in MLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)☆22Jun 25, 2025Updated 11 months ago
- repository for "Exploiting Proximity-Aware Tasks for Embodied Social Navigation" paper code☆12Nov 16, 2023Updated 2 years ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆11Nov 27, 2025Updated 6 months ago
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆86Jun 6, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Computer Systems Lab☆13Oct 16, 2025Updated 7 months ago
- ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation☆99May 23, 2026Updated 2 weeks ago
- DemoHLM: From One Demonstration to Generalizable Humanoid Loco-Manipulation☆24Oct 14, 2025Updated 7 months ago
- The repo for code, that hasn't been published yet☆14May 14, 2025Updated last year
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 9 months ago
- ☆21Dec 6, 2022Updated 3 years ago
- ☆33Jun 7, 2025Updated last year
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆46Nov 24, 2025Updated 6 months ago
- ☆47May 14, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆33Dec 12, 2025Updated 5 months ago
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 6 months ago
- EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI☆25Jan 17, 2026Updated 4 months ago
- this project provide a verity of code help you collect data from your robotic arm, have fun!☆232May 6, 2026Updated last month
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆30Dec 3, 2025Updated 6 months ago
- ☆11Oct 24, 2017Updated 8 years ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆40Nov 26, 2025Updated 6 months ago
- ☆26Nov 10, 2025Updated 6 months ago
- ☆17Jul 11, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆24Oct 8, 2025Updated 8 months ago
- Segmentation of blood vessel from CTA scan using bone subtraction and an iterative thresholding seeking algorithm☆12Apr 9, 2021Updated 5 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆51Nov 27, 2025Updated 6 months ago
- ☆67Jan 27, 2026Updated 4 months ago
- Provides useful controllers for using Franka Emika Panda robots with ros_control☆16May 27, 2026Updated last week
- turn your phone into a robot teleop setup☆32Jun 16, 2025Updated 11 months ago
- 深蓝学院视觉slam课后作业,以及视觉slam14讲☆15May 7, 2020Updated 6 years ago
- Accompanying code for "An Elastic Basis for Spectral Shape Correspondence"☆12Aug 2, 2023Updated 2 years ago
- Code release for "Differential Angular Imaging for Material Recognition", CVPR 2017.☆18Oct 9, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆183Jun 20, 2025Updated 11 months ago
- [2024 ECCV] Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework☆12Jun 13, 2025Updated 11 months ago
- Background resampling for out-of-distribution detection☆13Mar 27, 2020Updated 6 years ago
- Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"☆17Mar 25, 2024Updated 2 years ago
- Habitat ROS is a ROS 1 package for robot simulation in habitat-sim providing customizable robotic sensors (2D Laser, 3D Lidar, RGBD camer…☆41Jun 29, 2024Updated last year
- Incremental and Adaptative Gaussian Mixture Model Library is an implementation of a machine learning algorithms based on GMM. The algorit…☆13Mar 23, 2020Updated 6 years ago
- ☆11Mar 24, 2023Updated 3 years ago