[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning
☆30Nov 18, 2021Updated 4 years ago
Alternatives and similar repositories for SA_PPO
Users that are interested in SA_PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆28Sep 13, 2023Updated 2 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Feb 22, 2021Updated 5 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆68Jan 26, 2021Updated 5 years ago
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆46Dec 10, 2024Updated last year
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms)…☆27Nov 23, 2019Updated 6 years ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- PPO and PyMARL baseline for Pogema environment☆24Sep 18, 2024Updated last year
- Average-Reward Reinforcement Learning with Trust Region Methods☆11Oct 17, 2022Updated 3 years ago
- code for ROMANCE☆14Oct 12, 2024Updated last year
- Code for ICLR 2022 publication: Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL. https://openreview…☆10Aug 31, 2024Updated last year
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Single file implementations of Deep Multi-agent Reinforcement Learning☆65Feb 12, 2026Updated last month
- ☆10May 30, 2025Updated 9 months ago
- Using PPO algorithm for collision avoidance training of unmanned vehicles in Gazebo☆25Dec 15, 2025Updated 3 months ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- ☆10Jun 22, 2021Updated 4 years ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆91Mar 20, 2026Updated last week
- Code of On L-p Robustness of Decision Stumps and Trees, ICML 2020☆10Aug 3, 2020Updated 5 years ago
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆33Jan 21, 2022Updated 4 years ago
- Distributional Soft Actor Critic☆61Jun 6, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Apr 14, 2019Updated 6 years ago
- The implementation of Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System.☆11Sep 8, 2025Updated 6 months ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Jan 26, 2024Updated 2 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆11Aug 30, 2021Updated 4 years ago
- Connecting Interpretability and Robustness in Decision Trees through Separation☆17May 8, 2021Updated 4 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- implementation for "learning weighted deterministic automata from queries and counterexamples", neurips 2019☆18Jan 8, 2020Updated 6 years ago
- ☆13Jul 9, 2018Updated 7 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆14Dec 8, 2025Updated 3 months ago
- Transient Stability Analysis of Networked Microgrids Using Rapid Neural Lyapunov Method☆14Sep 13, 2023Updated 2 years ago
- Tufts Probabilistic Robotics Spring 2020 Final Project☆17May 7, 2020Updated 5 years ago
- Robustness for Non-Parametric Classification: A Generic Attack and Defense☆18Nov 21, 2022Updated 3 years ago
- Official Code for Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation (CVPR 2025)☆13Apr 2, 2025Updated 11 months ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated last year