Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆20Dec 16, 2018Updated 7 years ago
Alternatives and similar repositories for SafeOptionCritic
Users that are interested in SafeOptionCritic are comparing it to the libraries listed below
Sorting:
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Implementation of the Option-Critic Architecture☆41Dec 9, 2018Updated 7 years ago
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆182Sep 21, 2017Updated 8 years ago
- Scripts for performing experiments on semantic networks generated by a machine learning model.☆12Apr 7, 2019Updated 6 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆17Jun 25, 2019Updated 6 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 7 years ago
- ☆14Nov 21, 2022Updated 3 years ago
- ☆19Apr 22, 2024Updated last year
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- ☆43Feb 9, 2017Updated 9 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Jun 1, 2017Updated 8 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆143Aug 2, 2024Updated last year
- Implementation of the paper "Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models"☆24Sep 7, 2018Updated 7 years ago
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- ☆32Mar 19, 2024Updated last year
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆33Jan 23, 2021Updated 5 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- This is an implementation of PMG, and I visualize the features abstracted by convs. It's helpful for me to understand the structure of PM…☆11Feb 25, 2021Updated 5 years ago
- 信號與系統實習☆12Nov 9, 2025Updated 3 months ago
- dynamic planning, hybrid models, hierarchical active inference, tool use☆13Jun 13, 2025Updated 8 months ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 5 years ago
- ☆38Nov 12, 2017Updated 8 years ago
- [ECCV 2018] code for Choose Your Neuron: Incorporating Domain Knowledge Through Neuron Importance☆57Aug 8, 2018Updated 7 years ago
- Annotated bibliographies.☆40Aug 25, 2019Updated 6 years ago
- Approximate Multiparametric Mixed-integer Convex Programming☆15May 16, 2019Updated 6 years ago
- ☆13Mar 21, 2023Updated 2 years ago
- AWS FastAPI deployment on top of ALB and ECS with Docker containers implementing ECS as the orchestration tool for an AWS-managed infrast…☆10May 22, 2023Updated 2 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- record and share my reading everyday☆12Apr 1, 2016Updated 9 years ago