mtrazzi / gym-alttp-gridworldLinks
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Updated 7 years ago
Alternatives and similar repositories for gym-alttp-gridworld
Users that are interested in gym-alttp-gridworld are comparing it to the libraries listed below
Sorting:
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 8 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Trained models for keras-rl.☆21Updated 9 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Updated 7 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- Collection of tutorials, exercises and papers on RL☆17Updated 8 years ago
- Deep RL Bootcamp solutions☆34Updated 8 years ago
- AIXIjs - General Reinforcement Learning in the Browser☆148Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- ☆29Updated 7 years ago
- Command-line recursive question-answering with immutable contexts and explicit data store☆26Updated 7 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- A blog post exploring a connection between neural networks and topology☆102Updated 7 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Updated 5 years ago
- Interpretability dashboard for reinforcement learners☆16Updated 6 years ago
- presentations☆44Updated 7 years ago
- Web-based Reinforcement Learning Control Center☆65Updated 9 years ago
- ☆42Updated 8 years ago
- Practice for coding interviews.☆36Updated 6 years ago
- Reinforcement learning algorithms☆41Updated 6 years ago
- A probabilistic programming language, based on Church☆17Updated 8 years ago
- Algorithmic Intelligence Quotient☆39Updated 3 years ago
- An agent library for systems of nested automata.☆43Updated 8 years ago
- ☆11Updated 4 years ago
- MC-AIXI-CTW by Marcus Hutter and his students (in particular Daniel Visentin)☆49Updated 14 years ago
- Scripts to generate a dataset with static frames from the Arcade Learning Environment☆19Updated 11 years ago
- Some examples trained on very reduced versions of the MNIST training set☆47Updated 8 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 7 years ago