MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
☆70Nov 10, 2025Updated 6 months ago
Alternatives and similar repositories for MLA
Users that are interested in MLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)☆21Jun 25, 2025Updated 10 months ago
- repository for "Exploiting Proximity-Aware Tasks for Embodied Social Navigation" paper code☆11Nov 16, 2023Updated 2 years ago
- ☆264Aug 25, 2025Updated 8 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- DemoHLM: From One Demonstration to Generalizable Humanoid Loco-Manipulation☆22Oct 14, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Nov 27, 2025Updated 5 months ago
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆87Jun 6, 2025Updated 11 months ago
- ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation☆95Jul 14, 2025Updated 9 months ago
- Computer Systems Lab☆11Oct 16, 2025Updated 6 months ago
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆19Apr 23, 2025Updated last year
- The repo for code, that hasn't been published yet☆14May 14, 2025Updated 11 months ago
- Responsible Robotic Manipulation☆15Aug 31, 2025Updated 8 months ago
- ☆33Jun 7, 2025Updated 11 months ago
- Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"☆79Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆45Nov 24, 2025Updated 5 months ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆32Dec 12, 2025Updated 4 months ago
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 5 months ago
- EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI☆25Jan 17, 2026Updated 3 months ago
- LLaVA-Next for STVG☆19Dec 5, 2025Updated 5 months ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- ☆27Dec 20, 2024Updated last year
- [Paper][EMNLP 2025] Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM Enriching☆34Feb 8, 2026Updated 3 months ago
- Music Modeling Kit☆22Jan 10, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆29Dec 3, 2025Updated 5 months ago
- ☆11Oct 24, 2017Updated 8 years ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆38Nov 26, 2025Updated 5 months ago
- ☆25Nov 10, 2025Updated 6 months ago
- ☆17Jul 11, 2025Updated 10 months ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆24Oct 8, 2025Updated 7 months ago
- Segmentation of blood vessel from CTA scan using bone subtraction and an iterative thresholding seeking algorithm☆12Apr 9, 2021Updated 5 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆49Nov 27, 2025Updated 5 months ago
- ☆57Jan 27, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 3DPhysNet in Tensorflow (IJCAI 2018) https://arxiv.org/abs/1805.00328☆15Jul 2, 2018Updated 7 years ago
- Provides useful controllers for using Franka Emika Panda robots with ros_control☆16Apr 30, 2026Updated last week
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆11Apr 12, 2024Updated 2 years ago
- turn your phone into a robot teleop setup☆32Jun 16, 2025Updated 10 months ago
- ☆11Sep 11, 2018Updated 7 years ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆182Jun 20, 2025Updated 10 months ago
- [2024 ECCV] Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework☆12Jun 13, 2025Updated 10 months ago