Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
☆44Mar 18, 2026Updated this week
Alternatives and similar repositories for Vlaser
Users that are interested in Vlaser are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆107Jan 27, 2026Updated last month
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆78Nov 6, 2025Updated 4 months ago
- the official implementation of CogNav [ICCV 2025]☆65Sep 24, 2025Updated 5 months ago
- Reward Evolution with Large Language Models using Human Feedback☆18Nov 14, 2025Updated 4 months ago
- VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs☆51Jan 5, 2026Updated 2 months ago
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆33Jan 31, 2026Updated last month
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆38Dec 2, 2025Updated 3 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 8 months ago
- ☆35May 9, 2024Updated last year
- Code of paper "Temporal Consistent Automatic Video Colorization via Semantic Correspondence"☆10Apr 24, 2024Updated last year
- A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.☆11Nov 16, 2024Updated last year
- 🦾 A Dual-System VLA with System2 Thinking☆136Aug 21, 2025Updated 6 months ago
- [ICRA 2024]ASGrasp: Generalizable Transparent Object Reconstruction and 6-DoF Grasp Detection from RGB-D Active Stereo Camera☆97Jun 12, 2024Updated last year
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 4 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 9 months ago
- Home page☆21Jan 16, 2026Updated 2 months ago
- Piper based VoiceDock TTS implementation☆11Aug 12, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Control 3f robotiq gripper using python and modbus client☆13Jun 27, 2024Updated last year
- The official repository of the first version of ACE-Brain foundation model.☆62Mar 13, 2026Updated last week
- An extension of the Planner-Actor-Reporter framework applied to autonomous vehicles in Highway-Env and CARLA.☆16Jan 27, 2025Updated last year
- [NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving☆145Jan 2, 2026Updated 2 months ago
- ☆18May 7, 2022Updated 3 years ago
- An implementation of EMMA (End-to-End Multimodal Model for Autonomous Driving) using the Claude API, based on the EMMA paper.☆12Dec 14, 2024Updated last year
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆50Nov 26, 2025Updated 3 months ago
- [ICLR 2026] From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning☆55Feb 13, 2026Updated last month
- ☆16Nov 2, 2016Updated 9 years ago
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆15Feb 25, 2026Updated 3 weeks ago
- Automated detection of exudates from fundus images plays an important role in diabetic retinopathy (DR) screening and evaluation, for whi…☆11Dec 11, 2020Updated 5 years ago
- 本项目综合运用d3、echarts来完成可视化工作,实现了对nba两场比赛的可视化数据分析,包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法,我们能够进一步深入分析球队的具体情况,便于制定更佳的战术。☆15Dec 19, 2022Updated 3 years ago
- ☆24Oct 31, 2024Updated last year
- This is the official implementation of WiseAD.☆26Apr 22, 2025Updated 10 months ago
- ☆30Sep 11, 2025Updated 6 months ago
- This project utilizes deep reinforcement learning techniques to train a robot, which combines a mobile platform and a Panda robotic arm, …☆10Jun 7, 2023Updated 2 years ago
- FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition (AAAI 2022)☆15Oct 8, 2022Updated 3 years ago
- Yeet 88 agents at a problem and see what survives.☆24Feb 5, 2026Updated last month
- ☆22Oct 27, 2021Updated 4 years ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆383Feb 11, 2026Updated last month
- FastPoseCNN: Real-time 6D Pose and Size Estimation☆14Jul 6, 2021Updated 4 years ago