umd-huang-lab / WocaR-RLView external linksLinks
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
☆28Sep 13, 2023Updated 2 years ago
Alternatives and similar repositories for WocaR-RL
Users that are interested in WocaR-RL are comparing it to the libraries listed below
Sorting:
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆30Jan 21, 2022Updated 4 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆139Nov 16, 2021Updated 4 years ago
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated 11 months ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆67Jan 26, 2021Updated 5 years ago
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆46Dec 10, 2024Updated last year
- ☆19Jun 15, 2023Updated 2 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Feb 22, 2021Updated 4 years ago
- ☆20Oct 12, 2022Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80May 21, 2023Updated 2 years ago
- Assignments for CS294-112.☆16Jul 13, 2018Updated 7 years ago
- PPO and PyMARL baseline for Pogema environment☆24Sep 18, 2024Updated last year
- ☆24Jan 26, 2024Updated 2 years ago
- ☆23Feb 14, 2025Updated last year
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Aug 6, 2019Updated 6 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Oct 18, 2021Updated 4 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Official code repository for Prompt-DT.☆121Aug 3, 2022Updated 3 years ago
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆24Nov 9, 2024Updated last year
- Implementations of safe reinforcement learning algorithms☆29Mar 1, 2024Updated last year
- Tensorflow implementation for Robust Adversarial Reinforcement Learning: https://arxiv.org/pdf/1703.02702.pdf☆28Mar 7, 2018Updated 7 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Mar 27, 2021Updated 4 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Jun 2, 2023Updated 2 years ago
- The official repository of Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation☆25Jun 19, 2024Updated last year
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆36Jul 6, 2022Updated 3 years ago
- Learning to branch with reinforcement learning using retrospective trajectories for exact combinatorial optimisation.☆38Mar 15, 2023Updated 2 years ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- ☆11Jun 1, 2017Updated 8 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- python implementation baseline recommender systems☆11Oct 10, 2018Updated 7 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Dec 7, 2020Updated 5 years ago
- Implementation of Quantile-Constrained Policy Optimization (QCPO)☆11Sep 28, 2022Updated 3 years ago
- Code for SIGKDD2025 paper: An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem☆14Jan 28, 2025Updated last year
- Software and hardware electronic project around controlling flip-dot matrix displays.☆10May 31, 2019Updated 6 years ago
- Repository for my studies of Causal Inference☆10Dec 1, 2019Updated 6 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- DTLC-GAN Tensorflow☆12Aug 29, 2018Updated 7 years ago
- 🏆 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting☆18Feb 4, 2026Updated last week
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago