Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆20Dec 16, 2018Updated 7 years ago
Alternatives and similar repositories for SafeOptionCritic
Users that are interested in SafeOptionCritic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆18Jun 25, 2019Updated 6 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆184Sep 21, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- Least Squares Policy Iteration (LSPI) in Python☆11May 25, 2015Updated 10 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Jun 1, 2017Updated 8 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- ☆15Nov 21, 2022Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆145Aug 2, 2024Updated last year
- ☆23Nov 9, 2021Updated 4 years ago
- ☆43Feb 9, 2017Updated 9 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts for performing experiments on semantic networks generated by a machine learning model.☆12Apr 7, 2019Updated 7 years ago
- Physics-Informed Reinforcement Learning for Smart V2G EV Charging and Distribution Network Voltage Support.☆33Oct 16, 2025Updated 6 months ago
- ☆19Apr 22, 2024Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Successor Options is an option discovery framework for Reinforcement Learning☆14Jun 17, 2024Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆42May 16, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is an implementation of PMG, and I visualize the features abstracted by convs. It's helpful for me to understand the structure of PM…☆11Feb 25, 2021Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆34Jan 23, 2021Updated 5 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- Functions for analysing public patenting data.☆16Oct 9, 2018Updated 7 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- ☆33Mar 19, 2024Updated 2 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆73Jul 17, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 2025年科学院+工程院院士候选人负面网络舆情☆39Sep 6, 2025Updated 7 months ago
- ☆23Aug 22, 2025Updated 8 months ago
- Controlling a mobile robot using a Logitech F710 gamepad☆14Aug 11, 2025Updated 8 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- ☆14Jun 11, 2025Updated 10 months ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago