Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
Alternatives and similar repositories for intrinsic-fear-dqn
Users that are interested in intrinsic-fear-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 6 years ago
- TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)☆10Mar 1, 2018Updated 8 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- 2019 Fall - Game theory and Multi-agent RL Termproject☆10Dec 13, 2019Updated 6 years ago
- A3C tensorflow implementation☆11Jul 22, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reinforcement Learning program that looks to be able to quickly learn to solve a Rubik's Cube☆15Jun 22, 2021Updated 4 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 6 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- ☆20Jul 16, 2019Updated 6 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis☆10Jul 10, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆10Dec 3, 2025Updated 5 months ago
- scala multi-dimensional arrays with reverse-mode autodifferentiation☆18Nov 10, 2017Updated 8 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- ☆17Dec 4, 2019Updated 6 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated 11 months ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- allennlp + streamlit demo☆21Oct 2, 2019Updated 6 years ago
- A general implementation of a FFT, FIR and IIR filters and some other General Functions in a TMS320C5535 ezdsp including FFT and FIR, IIR…☆21Apr 20, 2016Updated 10 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 7 years ago
- ☆14May 30, 2019Updated 6 years ago
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- A C# wrapper for the WORLD vocoder☆24Jun 21, 2021Updated 4 years ago
- Summaries and minimal implementations of ML / statistics research articles.☆39Feb 23, 2021Updated 5 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Dec 28, 2017Updated 8 years ago
- A web page to collect reproduced papers in one place with their codes☆14Mar 8, 2023Updated 3 years ago
- The interface between probabilistic model checking and data-driven policy learning.☆19Apr 21, 2026Updated 2 weeks ago