mtrazzi / gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆17Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for gym-alttp-gridworld
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- Trained models for keras-rl.☆21Updated 8 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 7 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆54Updated 5 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- Modeling agents with probabilistic programs☆66Updated 5 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 2 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆22Updated 2 years ago
- Interpretability dashboard for reinforcement learners☆16Updated 5 years ago
- Command-line recursive question-answering with immutable contexts and explicit data store☆24Updated 6 years ago
- ☆42Updated 7 years ago
- presentations☆44Updated 5 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Cellular automaton-based calculus for the masses☆24Updated 6 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- This is the template for the gym agent.☆11Updated 6 years ago
- OpenAI's GPT2 integrated with slack.☆40Updated 5 years ago
- sketch-rnn demo for seoul mediacity biennale 2018☆13Updated 6 years ago
- Installation scripts for CUDA, cuDNN, TensorFlow, Caffe, etc. on Ubuntu machines☆24Updated 3 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Updated 4 years ago
- NAIL is an agent that plays text-based interactive fiction games.☆45Updated last year
- ☆30Updated 6 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- Code synthesis with Reinforcement learning☆9Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- ☆20Updated 5 years ago
- A probabilistic programming language, based on Church☆17Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago