mtrazzi / gym-alttp-gridworldLinks

A gym environment for Stuart Armstrong's model of a treacherous turn.

☆18

Alternatives and similar repositories for gym-alttp-gridworld

Users that are interested in gym-alttp-gridworld are comparing it to the libraries listed below

Sorting:

jvmncs / safe-grid-agents
Training (hopefully) safe agents in gridworlds
☆25Updated 6 years ago
paulfchristiano / alba
Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf
☆27Updated 8 years ago
agentmodels / agentmodels.org
Modeling agents with probabilistic programs
☆67Updated 5 years ago
mnielsen / rmnist
Some examples trained on very reduced versions of the MNIST training set
☆47Updated 7 years ago
PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆60Updated 4 years ago
matthiasplappert / keras-rl-weights
Trained models for keras-rl.
☆21Updated 8 years ago
johnswentworth / tracelang
Read, write and manipulate code which reads, writes and manipulates code.
☆10Updated 5 years ago
aslanides / aixijs
AIXIjs - General Reinforcement Learning in the Browser
☆148Updated 4 years ago
hardmaru / gecco-tutorial-2019
2019 talk at GECCO
☆68Updated 6 years ago
timvieira / rl
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Updated 4 years ago
pmlg / deep-rl-bootcamp
Deep RL Bootcamp solutions
☆35Updated 7 years ago
uber-research / Synthetic-Petri-Dish
☆42Updated 5 years ago
51alg / TerpreT
☆43Updated 7 years ago
dojoteef / glas
Generative Latent Attentive Sampler
☆26Updated 8 years ago
alok / rl_implementations
Reinforcement learning algorithm implementations and ML experimentation workspace
☆43Updated 6 years ago
andrewschreiber / agent
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
CarsonScott / OpenAgent
An agent library for systems of nested automata.
☆43Updated 8 years ago
mrahtz / gym-moving-dot
A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms
☆23Updated 2 years ago
colah / NN-Topology-Post
A blog post exploring a connection between neural networks and topology
☆101Updated 6 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
willieneis / ProBO
ProBO: Versatile Bayesian Optimization Using Any Probabilistic Programming Language
☆15Updated 6 years ago
alexander-turner / attainable-utility-preservation
☆12Updated 4 years ago
CarsonScott / Knowledge-Discovery-Agents
A Goal-Oriented Approach to Knowledge Discovery in Multi-Agent Systems
☆43Updated 8 years ago
awjuliani / RL-CC
Web-based Reinforcement Learning Control Center
☆64Updated 8 years ago
oughtinc / patchwork
Command-line recursive question-answering with immutable contexts and explicit data store
☆26Updated 6 years ago
attentionagent / attentionagent.github.io
Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)
☆21Updated 3 years ago
moridinamael / mc-aixi
MC-AIXI-CTW by Marcus Hutter and his students (in particular Daniel Visentin)
☆49Updated 14 years ago
matejbalog / gumbel-relatives
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Updated 8 years ago
ofnote / tsanley
A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")
☆40Updated 5 years ago
CW-Huang / BayesianHypernet
☆17Updated 7 years ago