mtrazzi / gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆17Updated 6 years ago
Alternatives and similar repositories for gym-alttp-gridworld:
Users that are interested in gym-alttp-gridworld are comparing it to the libraries listed below
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 7 years ago
- Trained models for keras-rl.☆21Updated 8 years ago
- ☆43Updated 5 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆54Updated 6 years ago
- Generative Latent Attentive Sampler☆26Updated 7 years ago
- imperative programming in TensorFlow☆18Updated 8 years ago
- Interpretability dashboard for reinforcement learners☆16Updated 5 years ago
- Sample code for generative recurrent autoencoders.☆25Updated 8 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 3 years ago
- A Google Chrome extension that notifies you when longer running Google Colaboraty cells are finished☆23Updated 5 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Updated 4 years ago
- ☆29Updated 6 years ago
- Easy genetic algorithm☆14Updated 7 years ago
- Reinforcement learning algorithms☆40Updated 6 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆23Updated 2 years ago
- Replication of Uber Neuroevolution paper☆46Updated 7 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- A probabilistic programming language, based on Church☆17Updated 7 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- Some examples trained on very reduced versions of the MNIST training set☆47Updated 7 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Updated 5 years ago
- Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.☆70Updated 4 years ago
- presentations☆44Updated 6 years ago
- Markov Decision Processes in Python☆15Updated 6 years ago
- Analogs of Linguistic Structure in Deep Representations☆19Updated 7 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Updated 7 years ago