Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto
☆37Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for reinforcement-learning
Users that are interested in reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Pytorch model to C++ for Vitis HLS☆20Updated this week
- Repository for the 2024 Telluride Topic Area Neuromorphic Systems for Space☆13Jun 21, 2024Updated last year
- Paper published in the Journal of Investment Management, co-authored with Sanjiv R. Das☆13Oct 4, 2017Updated 8 years ago
- Score-based Diffusion models in JAX.☆18Dec 29, 2025Updated 2 months ago
- ☆10Dec 19, 2019Updated 6 years ago
- We open-source our layout level fast EM simulation tool, EMSim, to the public.☆14Feb 8, 2024Updated 2 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Ideas of Data Science, ML, DL capstone projects to try out.☆14Feb 26, 2020Updated 6 years ago
- ESP32S3 AI voice assistant is a voice interaction system based on ESP32S3, implemented with Arduino IDE.☆12Aug 26, 2024Updated last year
- ☆14Nov 9, 2013Updated 12 years ago
- 浙江大学课程攻略共享计划☆12Jul 23, 2021Updated 4 years ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 4 months ago
- A Deep-Reinforcement-Learning-Based Scheduler for FPGA HLS☆15Feb 27, 2021Updated 5 years ago
- End-to-End Autonomous Driving with Spiking Neural Networks☆91Jan 13, 2025Updated last year
- RDF -to- text generator, using GANs and reinforcement learning. For Google summer of code 2020.☆14Mar 25, 2023Updated 2 years ago
- Seminars for OS course☆11Dec 1, 2023Updated 2 years ago
- Polyphonic Sound Detection Score (PSDS)☆16Jan 20, 2020Updated 6 years ago
- ☆13Mar 16, 2025Updated last year
- ☆15Dec 6, 2017Updated 8 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 4 months ago
- Implementation of Proximal Policy Optimization (PPO) for continuous action space (`Pendulum-v1` from gym) using tensorflow2.x and pytorch…☆10Aug 8, 2022Updated 3 years ago
- FMCW LiDAR implementation in CARLA simulator☆18Mar 18, 2024Updated 2 years ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- ☆13May 3, 2017Updated 8 years ago
- Using reinforcement learning to make markets in the high frequency trading setting.☆27Apr 8, 2025Updated 11 months ago
- A clone of Shazam that I made independently for coursework, with a small dataset to prove the concept works.☆14Jul 11, 2023Updated 2 years ago
- Learning ReLU INRs with B-spline wavelets.☆14Jun 5, 2024Updated last year
- Exploring HMM, LSTM and Regression techniques to predict respiratory rate of an individual from accelerometer data.☆15Dec 4, 2018Updated 7 years ago
- ☆15Oct 21, 2023Updated 2 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆16Nov 22, 2025Updated 4 months ago
- A simple tutorial to add medical reasoning using GRPO☆20Feb 10, 2025Updated last year
- ☆37Feb 4, 2026Updated last month
- ☆11Apr 28, 2024Updated last year
- Summer School Week 1 & 2 repo☆12Jul 1, 2022Updated 3 years ago
- A comprehensive time-series dataset survey☆20Aug 1, 2022Updated 3 years ago
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- Quantitative Derivatives Models☆16Aug 28, 2024Updated last year
- ☆10Jan 23, 2025Updated last year
- Metrics for spiking neural networks based on torchmetrics☆13Mar 27, 2023Updated 2 years ago