Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
☆45Mar 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for Vlaser
Users that are interested in Vlaser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆109Jan 27, 2026Updated 2 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆83Nov 6, 2025Updated 5 months ago
- the official implementation of CogNav [ICCV 2025]☆71Sep 24, 2025Updated 6 months ago
- Reward Evolution with Large Language Models using Human Feedback☆18Nov 14, 2025Updated 4 months ago
- VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs☆51Jan 5, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆35Jan 31, 2026Updated 2 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆39Dec 2, 2025Updated 4 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 9 months ago
- Code of paper "Temporal Consistent Automatic Video Colorization via Semantic Correspondence"☆10Apr 24, 2024Updated last year
- ☆35May 9, 2024Updated last year
- A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.☆11Nov 16, 2024Updated last year
- 🦾 A Dual-System VLA with System2 Thinking☆138Aug 21, 2025Updated 7 months ago
- [ICRA 2024]ASGrasp: Generalizable Transparent Object Reconstruction and 6-DoF Grasp Detection from RGB-D Active Stereo Camera☆98Jun 12, 2024Updated last year
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆19Nov 11, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆43May 25, 2025Updated 10 months ago
- Home page☆21Jan 16, 2026Updated 2 months ago
- Piper based VoiceDock TTS implementation☆11Aug 12, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- Control 3f robotiq gripper using python and modbus client☆13Jun 27, 2024Updated last year
- An extension of the Planner-Actor-Reporter framework applied to autonomous vehicles in Highway-Env and CARLA.☆16Jan 27, 2025Updated last year
- [NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving☆147Jan 2, 2026Updated 3 months ago
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆49Nov 26, 2025Updated 4 months ago
- An implementation of EMMA (End-to-End Multimodal Model for Autonomous Driving) using the Claude API, based on the EMMA paper.☆12Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR 2026] From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning☆57Feb 13, 2026Updated last month
- ☆18May 7, 2022Updated 3 years ago
- ☆16Nov 2, 2016Updated 9 years ago
- 📚 2025 Scene Graph ArXiv Paper List — Updated Daily☆16Mar 18, 2026Updated 3 weeks ago
- Automated detection of exudates from fundus images plays an important role in diabetic retinopathy (DR) screening and evaluation, for whi…☆11Dec 11, 2020Updated 5 years ago
- 本项目综合运用d3、echarts来完成可视化工作,实现了对nba两场比赛的可视化数据分析,包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法,我们能够进一步深入分析球队的具体情况,便于制定更佳的战术。☆15Dec 19, 2022Updated 3 years ago
- ☆24Oct 31, 2024Updated last year
- ☆31Mar 24, 2026Updated 2 weeks ago
- This project utilizes deep reinforcement learning techniques to train a robot, which combines a mobile platform and a Panda robotic arm, …☆10Jun 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This is the official implementation of WiseAD.☆26Apr 22, 2025Updated 11 months ago
- FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition (AAAI 2022)☆15Oct 8, 2022Updated 3 years ago
- Yeet 88 agents at a problem and see what survives.☆24Feb 5, 2026Updated 2 months ago
- ☆22Oct 27, 2021Updated 4 years ago
- FastPoseCNN: Real-time 6D Pose and Size Estimation☆14Jul 6, 2021Updated 4 years ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆393Feb 11, 2026Updated last month
- Retinal lesions segmentation using CNNs and adversarial training: A Degree Thesis Submitted to the Faculty of Escola Tècnica d’Enginyeria…☆10Oct 27, 2019Updated 6 years ago