zackchase/intrinsic-fear-dqn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zackchase/intrinsic-fear-dqn)

zackchase / intrinsic-fear-dqn

Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.

☆10

Alternatives and similar repositories for intrinsic-fear-dqn

Users that are interested in intrinsic-fear-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rems75 / SPIBB-DQN
View on GitHub
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11May 5, 2020Updated 6 years ago
ZidanMusk / deep-RL-DQN-tensorflow
View on GitHub
TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)
☆10Mar 1, 2018Updated 8 years ago
shihao1007 / twitter_streaming_app
View on GitHub
☆11Feb 11, 2020Updated 6 years ago
Junyoungpark / GTMARL-SC2ENV
View on GitHub
2019 Fall - Game theory and Multi-agent RL Termproject
☆10Dec 13, 2019Updated 6 years ago
SentientOrange / Rubiks-Cube
View on GitHub
Reinforcement Learning program that looks to be able to quickly learn to solve a Rubik's Cube
☆15Jun 22, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yuishihara / A3C-tensorflow
View on GitHub
A3C tensorflow implementation
☆11Jul 22, 2018Updated 8 years ago
RomainLaroche / SPIBB
View on GitHub
Safe Policy Improvement with Baseline Bootstrapping
☆26May 5, 2020Updated 6 years ago
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 3 years ago
davidkerkkamp / DQN-GNN
View on GitHub
Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets
☆17Jan 15, 2022Updated 4 years ago
ermongroup / CalibratedModelBasedRL
View on GitHub
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆54May 15, 2019Updated 7 years ago
sebastianrisi / ga-world-models
View on GitHub
☆20Jul 16, 2019Updated 7 years ago
siemanko / a3c
View on GitHub
Asynchronous Advantage Actor Critic
☆20Aug 15, 2016Updated 9 years ago
LukasSchaefer / MSc_Curiosity_MARL
View on GitHub
MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning
☆13Aug 16, 2019Updated 6 years ago
strumswell / twitter-follower-graph
View on GitHub
Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis
☆10Jul 10, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Gouet / QMIX-Starcraft
View on GitHub
☆17Dec 4, 2019Updated 6 years ago
LAVA-LAB / safe-slac
View on GitHub
Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.
☆11Mar 1, 2023Updated 3 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
avibryant / nda
View on GitHub
scala multi-dimensional arrays with reverse-mode autodifferentiation
☆18Nov 10, 2017Updated 8 years ago
MatheusMRFM / A3C-LSTM-with-Tensorflow
View on GitHub
An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.
☆29Oct 18, 2017Updated 8 years ago
jeappen / gym-grid
View on GitHub
A simple Gridworld environment for Open AI gym
☆25Jun 10, 2018Updated 8 years ago
rjagerman / wsdm2019-nonstationary
View on GitHub
Non-stationary Off-policy Evaluation
☆13Nov 8, 2018Updated 7 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
joelgrus / streamlit-allennlp
View on GitHub
allennlp + streamlit demo
☆21Oct 2, 2019Updated 6 years ago
nikonikolov / rltf
View on GitHub
Reinforcement Learning implementations and research prototyping in TensorFlow
☆81Apr 28, 2019Updated 7 years ago
nnaisense / 2017-learning-to-run
View on GitHub
The Winning Solution for the Learning To Run Challenge 2017
☆60Jul 4, 2018Updated 8 years ago
joaodias / DSP-Digital-Signal-Processing-
View on GitHub
A general implementation of a FFT, FIR and IIR filters and some other General Functions in a TMS320C5535 ezdsp including FFT and FIR, IIR…
☆21Apr 20, 2016Updated 10 years ago
roop-pal / Meta-Learning-for-StarCraft-II-Minigames
View on GitHub
We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.
☆29Mar 30, 2021Updated 5 years ago
miryoosefi / ConRL
View on GitHub
Constrained episodic reinforcement learning in concave-convex and knapsack settings
☆11Oct 3, 2023Updated 2 years ago
sisl / pomdpland
View on GitHub
A tour of Pomdpland
☆10Aug 10, 2022Updated 3 years ago
ievron / RegularizationAnimation
View on GitHub
☆11Dec 27, 2021Updated 4 years ago
pdvelez / ml_soccer
View on GitHub
Soccer toy example simulator used in Reinforcement Learning
☆12Mar 11, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
orrivlin / MountainCar_DQN_RND
View on GitHub
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
☆41Jan 28, 2019Updated 7 years ago
dtak / POPCORN-POMDP
View on GitHub
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11May 19, 2021Updated 5 years ago
aqtq314 / World.NET
View on GitHub
A C# wrapper for the WORLD vocoder
☆24Jun 21, 2021Updated 5 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
dtak / hip-mdp-public
View on GitHub
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆29Dec 28, 2017Updated 8 years ago
CVLab-TUDelft / reproduced-papers
View on GitHub
A web page to collect reproduced papers in one place with their codes
☆14Mar 8, 2023Updated 3 years ago
tu-rbo / learning-state-representations-with-robotic-priors
View on GitHub
Code and data accompaning the paper "Learning State Representations with Robotic Priors" (Jonschkowski and Brock, 2015).
☆18Nov 14, 2017Updated 8 years ago