Python implementations of the RL algorithms in examples and figures in Sutton & Barto, Reinforcement Learning: An Introduction
☆96Oct 31, 2018Updated 7 years ago
Alternatives and similar repositories for sutton_barto
Users that are interested in sutton_barto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NHS AI Lab Skunkworks’ project: Data Lens☆17Jul 11, 2022Updated 3 years ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Mar 21, 2025Updated last year
- Reinforcement Learning examples implementation and explanation☆345Jul 9, 2024Updated last year
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆22Mar 5, 2026Updated 3 weeks ago
- Notes and exercise solutions for second edition of Sutton & Barto's book☆406Oct 2, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- self-studying the Sutton & Barto the hard way☆204Nov 27, 2021Updated 4 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- Soulbound POAP☆10Sep 1, 2022Updated 3 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- Implementing REINFORCE algorithm on Pong, Lunar Lander and Cartplot + Medium Article☆23Nov 24, 2020Updated 5 years ago
- Companion code to CoRL 2019 paper: E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh. "Asking Easy Questions: A User-Friendly Approach to…☆18Oct 13, 2020Updated 5 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- 0xAA Wallet is a AA (Account Abstraction) wallet focused on developer experience, which helps developers build ERC4337 compatible Dapp.☆11Apr 1, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A ERC1155-based SBT (soulbound token) implementation by WTF Academy☆12Jun 11, 2024Updated last year
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Haskell to D3.js binding by deep EDSL approach.☆23Sep 20, 2014Updated 11 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- SDSC Summer Institute 2018 Teaching Material☆10Nov 25, 2022Updated 3 years ago
- Code repository for the paper "Learning partial differential equations for biological transport models from noisy spatiotemporal data"☆10Jul 3, 2019Updated 6 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Jul 25, 2024Updated last year
- ☆24Oct 3, 2025Updated 5 months ago
- Code for the figures in Chapter 13 of "Reinforcement Learning: An Introduction" by Sutton and Barto☆14Jul 6, 2023Updated 2 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- A straightforward implementation of the mapper construction by Carlsson-Memoli-Singh. I wrote a little blog post about it at http://blog.…☆15Mar 18, 2015Updated 11 years ago
- Reinforcement Learning to teach a Neato to follow a line.☆10Apr 2, 2017Updated 8 years ago
- ☆17Jan 6, 2024Updated 2 years ago
- Haskell binding for Menoh DNN inference library☆12Nov 30, 2018Updated 7 years ago
- Scripts for running several OpenAI Baselines algorithms on all MuJoCo or Roboschool gym environments to compare performance.☆12Sep 25, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm☆12May 19, 2020Updated 5 years ago
- A Clojure library for writing difference equations☆13Jul 15, 2017Updated 8 years ago
- Generic API for dispatch to Pyro backends.☆16Feb 13, 2022Updated 4 years ago
- Implementation of fundamental concepts and algorithms for reinforcement learning☆15May 24, 2020Updated 5 years ago
- AWS virtual infrastructure simulator for training reinforcement learning based cloud capacity management systems☆11Sep 23, 2020Updated 5 years ago
- ☆11Aug 22, 2017Updated 8 years ago