PKU-Alignment/SafeVLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-Alignment/SafeVLA)

PKU-Alignment / SafeVLA

[NeurIPS 2025 Spotlight] Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning.

☆153

Alternatives and similar repositories for SafeVLA

Users that are interested in SafeVLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THU-RCSCT / vlsa-aegis
View on GitHub
A vision-language-safety action architecture, named AEGIS, which contains a plug-and-play safety constraint layer formulated via control …
☆115Jun 30, 2026Updated 3 weeks ago
vla-safe / SAFE
View on GitHub
This is the official repository for "SAFE: Multitask Failure Detection for Vision-Language-Action Models" (NeurIPS 2025)
☆86May 21, 2026Updated 2 months ago
kodenii / Responsible-Robotic-Manipulation
View on GitHub
Responsible Robotic Manipulation
☆16Aug 31, 2025Updated 10 months ago
William-wAng618 / roboticAttack
View on GitHub
Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
☆81Mar 26, 2026Updated 3 months ago
shengyin1224 / SafeAgentBench
View on GitHub
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
☆74Feb 25, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
eliotjones1 / robogcg
View on GitHub
Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"
☆35May 28, 2025Updated last year
UCSB-AI / MSSBench
View on GitHub
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆36Jun 23, 2025Updated last year
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆707Jun 23, 2025Updated last year
PKU-Alignment / VLA-Arena
View on GitHub
VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.
☆191Jul 5, 2026Updated 2 weeks ago
asimov-benchmark / code
View on GitHub
☆27Mar 11, 2025Updated last year
Zxy-MLlab / BadVLA
View on GitHub
The official repository for paper: BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization
☆55Dec 9, 2025Updated 7 months ago
CWRU-AISM / action-atlas
View on GitHub
Mechanistic Interpretability toolkit for Vision-Language-Action models
☆20Jul 8, 2026Updated last week
GuanxingLu / vlarl
View on GitHub
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆445Nov 8, 2025Updated 8 months ago
Koorye / PCD
View on GitHub
[ICLR 2026] Official implemetation of the paper "Policy Contrastive Decoding for Robotic Foundation Models"
☆28Mar 5, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆352Oct 3, 2025Updated 9 months ago
LiQiiiii / Awesome-VLA-Safety
View on GitHub
[Arxiv] Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
☆125Jul 13, 2026Updated last week
PKU-Alignment / eval-anything
View on GitHub
☆22Jul 26, 2025Updated 11 months ago
cau-hai-lab / LIBERO-Para
View on GitHub
Official code for "LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models" (arXiv 2603.28301).
☆41Jun 27, 2026Updated 3 weeks ago
JiahengHu / FLaRe
View on GitHub
[ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
☆49Jan 5, 2025Updated last year
NVlabs / AHA
View on GitHub
A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
☆71Apr 1, 2025Updated last year
Koorye / Inspire
View on GitHub
[ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆51Feb 2, 2026Updated 5 months ago
irom-princeton / byovla
View on GitHub
Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024
☆39Jan 22, 2025Updated last year
x-zheng16 / Awesome-Embodied-AI-Safety
View on GitHub
Safety in Embodied AI: A Survey of Risks, Attacks, and Defenses | 500+ Papers | Perception, Cognition, Planning, Interaction, Agentic Sys…
☆118Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆28Jun 11, 2025Updated last year
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,303Sep 9, 2025Updated 10 months ago
AI45Lab / IS-Bench
View on GitHub
[AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆47Nov 24, 2025Updated 7 months ago
Rookie143 / BadRobot
View on GitHub
This is the official repository for the ICLR 2025 accepted paper Badrobot: Manipulating Embodied LLMs in the Physical World.
☆46Jun 11, 2026Updated last month
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,110Nov 19, 2025Updated 8 months ago
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆6,661Mar 23, 2025Updated last year
AI45Lab / VLSBench
View on GitHub
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆62Jul 21, 2025Updated last year
OpenHelix-Team / ReconVLA
View on GitHub
Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
☆270Apr 1, 2026Updated 3 months ago
RUCAIBox / HADES
View on GitHub
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆39Oct 23, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CMU-IntentLab / UNISafe
View on GitHub
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures (CoRL 2025)
☆30Jun 23, 2026Updated 3 weeks ago
shihao1895 / MemoryVLA
View on GitHub
[ICLR 2026] Code of "MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation"
☆305Jun 13, 2026Updated last month
InternRobotics / InstructVLA
View on GitHub
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
☆116Jan 27, 2026Updated 5 months ago
yuffish / rebot
View on GitHub
[IROS 2025] ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis
☆26May 17, 2025Updated last year
snu-mllab / Bayesian-Red-Teaming
View on GitHub
About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)
☆15Jul 9, 2023Updated 3 years ago
trustmlyoungscientist / EDPA_attack_defense
View on GitHub
Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models
☆18Dec 12, 2025Updated 7 months ago
PKU-Alignment / SafeDreamer
View on GitHub
ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models
☆105Apr 8, 2024Updated 2 years ago