Implementation of Reinforcement Learning algorithms in Python, based on Sutton's & Barto's Book (Ed. 2)
☆159May 3, 2020Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]☆155Jan 14, 2021Updated 5 years ago
- Notes and exercise solutions for second edition of Sutton & Barto's book☆405Oct 2, 2022Updated 3 years ago
- 📖Learning reinforcement learning by implementing the algorithms from reinforcement learning an introduction☆84Mar 8, 2026Updated last month
- Implementations for solutions to programming exercises of Reinforcement Learning: An Introduction, Second Edition (Sutton & Barto)☆33Jun 23, 2022Updated 3 years ago
- Using NLP and reinforcement learning to build an AI capable of playing text-based games☆28Apr 1, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆19Jan 28, 2018Updated 8 years ago
- Berkeley CS285 2019 homework solution☆30Mar 24, 2023Updated 3 years ago
- Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).☆404Jul 24, 2023Updated 2 years ago
- Deep Q-Network (DQN) to play classic Atari Games☆11Sep 18, 2017Updated 8 years ago
- Winning solution of the Microsoft Research "First TextWorld Problems: A Reinforcement and Language Learning Challenge"☆12Jun 21, 2022Updated 3 years ago
- Gradient descent algorithms for LQG control☆14Feb 20, 2022Updated 4 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆14,608Aug 9, 2024Updated last year
- Reinforcement based gain calculation for a tracking LQR using actor-critic method☆24Mar 27, 2021Updated 5 years ago
- Collection of presentation of my work on various platforms and meetups☆22Feb 2, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Minimal implementations of reinforcement learning algorithms by Tensorflow☆29Nov 29, 2017Updated 8 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆16Jan 22, 2019Updated 7 years ago
- Curatable database for experimental and theoretical data on solid materials.☆13Sep 21, 2025Updated 6 months ago
- ☆12Jul 29, 2022Updated 3 years ago
- ☆12Jan 13, 2025Updated last year
- Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning☆13Jun 28, 2022Updated 3 years ago
- a pytorch implementation of pensieve (https://github.com/hongzimao/pensieve)☆20Dec 24, 2019Updated 6 years ago
- ☆22Mar 18, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Interactive way to learn Algorithms . Feel free to contribute!☆24May 15, 2021Updated 4 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- Coin collector game in Microsoft TextWorld, and a simple RL agent solving it.☆37Aug 6, 2021Updated 4 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Equivalent Linear Mappings of Large Language Models☆34Nov 7, 2025Updated 5 months ago
- ☆19Apr 25, 2017Updated 8 years ago
- working example of a contextual multi-armed bandit☆55Sep 3, 2019Updated 6 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- My Solutions to Sutton and Barto exercises, 2nd edition☆14Apr 27, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 🦾Distributed Natural Evolution Strategies Build with PyTorch and Ray☆18Jul 20, 2018Updated 7 years ago
- performing sentiment analysis on the whatsapp chats.☆23Oct 17, 2017Updated 8 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Dec 2, 2015Updated 10 years ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- Code for the figures in Chapter 13 of "Reinforcement Learning: An Introduction" by Sutton and Barto☆14Jul 6, 2023Updated 2 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Generate text and predict next word for an initial piece of text using RNNs and LSTMs☆11Jun 27, 2017Updated 8 years ago