Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto
☆37Aug 23, 2025Updated 6 months ago
Alternatives and similar repositories for reinforcement-learning
Users that are interested in reinforcement-learning are comparing it to the libraries listed below
Sorting:
- PyTorch helper module to translate to and from NIR☆17Jan 23, 2026Updated last month
- From Pytorch model to C++ for Vitis HLS☆20Feb 24, 2026Updated last week
- State-of-the-art architecture for Plant Disease Detection using Deep Learning.☆10Jul 4, 2022Updated 3 years ago
- JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.☆44Feb 21, 2026Updated last week
- Telegram bot for facilitating, accelerating, and automating the reservations at the University of Milan’s libraries, specifically the Bib…☆20Feb 12, 2026Updated 2 weeks ago
- ☆10Dec 19, 2019Updated 6 years ago
- Reference code for the paper ""Centroid-Guided Target-Driven Topology Control Method for UAV Ad-Hoc Networks Based on Tiny Deep Reinforce…☆10Oct 21, 2024Updated last year
- ☆13Feb 4, 2025Updated last year
- Jupyter notebook templates for processing and analyzing neuroscience data.☆13Dec 28, 2025Updated 2 months ago
- ☆12Oct 4, 2021Updated 4 years ago
- ☆10Jan 23, 2025Updated last year
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆39Jan 16, 2026Updated last month
- Scaling safe exploration to vision control☆14Feb 19, 2025Updated last year
- Implementation of Diffusion Policy☆13Dec 13, 2024Updated last year
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆17Apr 13, 2025Updated 10 months ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year
- Official PyTorch implementation of POEM (Partial Observation Experts Modelling) as introduced in the paper Contrastive Meta-Learning for …☆12Nov 1, 2023Updated 2 years ago
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- RL for Energy Management of Microgrids☆10Mar 28, 2020Updated 5 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- heterogeneous graph attention network for SMEs bankruptcy prediction☆12Feb 26, 2021Updated 5 years ago
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆11Feb 6, 2025Updated last year
- A small framework for benchmarking machine learning models.☆21Jun 6, 2025Updated 8 months ago
- A nascent Jax-based package for virtual brain modeling.☆12Feb 3, 2026Updated last month
- Tutorials for working with ADCIRC data and the CERA visualization software☆10May 15, 2024Updated last year
- A Middle Earth total conversion mod for Victoria II. It is still in the early stages of development☆14Feb 23, 2026Updated last week
- This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".☆10Aug 19, 2025Updated 6 months ago
- A clone of Shazam that I made independently for coursework, with a small dataset to prove the concept works.☆14Jul 11, 2023Updated 2 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- ☆13Oct 13, 2024Updated last year
- Hands-on tutorial about Meta RL and GP-MPC at the RL4AA'24 workshop.☆15May 10, 2024Updated last year
- Notes for EE364a - Convex Optimization I @ Stanford (will update Ch 6 - Ch 13 later)☆11Jul 30, 2019Updated 6 years ago
- JAX-DIPS is a differentiable interfacial PDE solver.☆50Sep 14, 2024Updated last year
- Scripts and datasets for mining Wi-Fi CSI mainly for breathing rate monitoring but it can be customized and extended for any other CSI-ba…☆13Mar 19, 2025Updated 11 months ago
- CS886: Graph Neural Networks☆12Mar 28, 2025Updated 11 months ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 3 months ago