A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Jul 28, 2018Updated 7 years ago
Alternatives and similar repositories for gym-alttp-gridworld
Users that are interested in gym-alttp-gridworld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Dec 6, 2018Updated 7 years ago
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.☆20Dec 20, 2021Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61May 13, 2021Updated 4 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Oct 20, 2021Updated 4 years ago
- This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a comb…☆18Jul 27, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Interpretability dashboard for reinforcement learners☆16Jun 4, 2019Updated 6 years ago
- ☆12Jun 14, 2021Updated 4 years ago
- Unified notation for Markov Decision Processes PO(MDP)s☆24Apr 27, 2018Updated 7 years ago
- Command-line spaced repetition scheduler.☆10Mar 8, 2015Updated 11 years ago
- Function annotations for Hylang!☆11Nov 12, 2014Updated 11 years ago
- Reimplementation of the clockwork recurrent neural network in Torch7☆14Feb 4, 2016Updated 10 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Internet Chess ToolKit is a java based set of libraries and widgets useful for performing common tasks such as reading PGN, FEN, and gene…☆12Feb 22, 2017Updated 9 years ago
- ☆13Sep 24, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.☆37Mar 28, 2019Updated 7 years ago
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Feb 23, 2021Updated 5 years ago
- Farcaster-feed is a Farcaster protocol syndication tool for Node.js☆15Sep 28, 2022Updated 3 years ago
- Ack-like search tool written in Rust☆18Mar 14, 2016Updated 10 years ago
- Read, write and manipulate code which reads, writes and manipulates code.☆10Mar 15, 2020Updated 6 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27May 30, 2017Updated 8 years ago
- Training (hopefully) safe agents in gridworlds☆25May 12, 2019Updated 6 years ago
- Rust wrapper for STAR aligner☆20Updated this week
- Spectral Method for Multiple Experts Inverse Reinforcement Learning☆14Sep 6, 2014Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Mar 14, 2021Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)