📖Learning reinforcement learning by implementing the algorithms from reinforcement learning an introduction
☆84Mar 8, 2026Updated 2 months ago
Alternatives and similar repositories for sutton-barto-rl-exercises
Users that are interested in sutton-barto-rl-exercises are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto☆20Jul 16, 2019Updated 6 years ago
- Implementations for solutions to programming exercises of Reinforcement Learning: An Introduction, Second Edition (Sutton & Barto)☆33Jun 23, 2022Updated 3 years ago
- Hands On Reinforcement Learning with Python[Video], Published by Packt☆13Jan 14, 2021Updated 5 years ago
- Notes and exercise solutions for second edition of Sutton & Barto's book☆405Oct 2, 2022Updated 3 years ago
- Bayesian Network with R and Hadoop☆23Jun 6, 2014Updated 11 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python implementations of the RL algorithms in examples and figures in Sutton & Barto, Reinforcement Learning: An Introduction☆97Oct 31, 2018Updated 7 years ago
- Price options by fitting a Lévy distribution☆10Jan 20, 2021Updated 5 years ago
- This repository contains the code used to run generate the data splits, run the hyperparameter tunings, and export the results presented …☆14Jul 22, 2022Updated 3 years ago
- Your best resource to learn mixed-integer programming to solve practical decision-making problems.☆26Feb 18, 2025Updated last year
- ☆12Jun 7, 2018Updated 7 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆14,640Aug 9, 2024Updated last year
- I write some notes about the time I try to ac the leetcode problems.☆12Mar 10, 2020Updated 6 years ago
- Deep Q-Networks in tensorflow☆10Apr 4, 2017Updated 9 years ago
- Jetson nano trials and playground for robotics.☆16Oct 6, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A command line tool to query source code from your current Python env☆17Apr 6, 2026Updated last month
- Reinforcement Learning for Cut Selection☆12Dec 8, 2022Updated 3 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆47Nov 15, 2017Updated 8 years ago
- Ensemble/Blender example in R using Caret (companion code for YouTube video: https://www.youtube.com/watch?v=k7sTiTWWCXM)☆11Sep 19, 2014Updated 11 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- A quadcopter simulator in matlab.☆17Oct 1, 2019Updated 6 years ago
- Minimal implementations of reinforcement learning algorithms by Tensorflow☆29Nov 29, 2017Updated 8 years ago
- The MIP Workshop 2023 Computational Competition☆39Feb 2, 2024Updated 2 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Dec 2, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆28Nov 28, 2021Updated 4 years ago
- Generate text and predict next word for an initial piece of text using RNNs and LSTMs☆11Jun 27, 2017Updated 8 years ago
- Links to Machine Learning Blogs☆12Feb 29, 2020Updated 6 years ago
- The simplex algorithm, implemented in Cuda and for CPU (ECE1782 project)☆17Jun 29, 2020Updated 5 years ago
- Solves the Vehicle Routing Problem (VRP) using Column Generation (CG). It is made as an inspiration to use CG in more projects, since it …☆10Nov 2, 2022Updated 3 years ago
- Python library for Multi-Armed Bandits☆769Feb 11, 2020Updated 6 years ago
- 2D toy datasetを用いたRealNVPの非常に簡単な実例です。ライブラリはPyTorchを用いていま す。☆12Dec 29, 2018Updated 7 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Code for the figures in Chapter 13 of "Reinforcement Learning: An Introduction" by Sutton and Barto☆14Jul 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Nov 10, 2017Updated 8 years ago
- The code of paper "Nonlinear Hybrid Planning with Deep Net Learned Transition Models and Mixed-Integer Linear Programming." published on …☆10Apr 27, 2018Updated 8 years ago
- Regression modeling of sub-distribution functions in competing risks☆14Aug 2, 2023Updated 2 years ago
- (ARCHIVED - use IDAES/examples) Example Python code, Jupyter Notebooks, and other files for the IDAES PSE☆20Apr 6, 2023Updated 3 years ago
- Text generation for the Shakespeare model☆13Apr 26, 2017Updated 9 years ago
- ☆15May 31, 2017Updated 8 years ago
- Python + Numpy + Scipy Implementation of LARS and LASSO☆12Oct 19, 2010Updated 15 years ago