Notes and solutions to exercises in Sutton and Barto's Reinforcement Learning textbook
☆50Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for sutton_and_barto
Users that are interested in sutton_and_barto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).☆404Jul 24, 2023Updated 2 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆24Nov 8, 2024Updated last year
- Solutions of Reinforcement Learning, An Introduction☆2,397Jul 10, 2025Updated 9 months ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆35Jun 28, 2024Updated last year
- ☆23Nov 30, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- trust and specialty projections of waiting lists, for all trusts in england, updated each month☆10May 16, 2023Updated 2 years ago
- Notes and exercise solutions for second edition of Sutton & Barto's book☆405Oct 2, 2022Updated 3 years ago
- ☆20Oct 24, 2022Updated 3 years ago
- This repository contains the notebooks of the series 'transformers by doing - leaving no rock unturned'☆13Sep 24, 2023Updated 2 years ago
- ☆12Sep 29, 2021Updated 4 years ago
- This is a project extending the solution to the kaggle-connectx problem statement. Here I have made the frontend UI for the same and adde…☆10Mar 8, 2021Updated 5 years ago
- A rich and diverse dataset created with GPT-4 for training and evaluating conversational models in Hinglish☆15Aug 31, 2023Updated 2 years ago
- A python library which simplifies creating and exporting videos.☆11Oct 1, 2023Updated 2 years ago
- Wrapper for DeviantArt API with typings☆10Mar 15, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿☆17Jan 23, 2017Updated 9 years ago
- ☆12Jun 15, 2023Updated 2 years ago
- Course repo for Advanced Machine Learning Course at Linköping University☆17Oct 28, 2025Updated 5 months ago
- The official source code and datasets for the paper titled "Evaluating ChatGPT as a Recommender System: A Rigorous Approach"☆14Apr 24, 2024Updated last year
- ⭐ My own world.☆17Updated this week
- ☆11Jun 28, 2022Updated 3 years ago
- MinHash implementation in Python☆12Aug 24, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My personal practice to implement algorithms of RL from scratch.☆38May 18, 2020Updated 5 years ago
- ☆102May 10, 2020Updated 5 years ago
- Beer Game implemented as an OpenAI gym environment.☆17Aug 4, 2019Updated 6 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆17Jul 22, 2021Updated 4 years ago
- Visualize your Makefile using GraphViz dot utility☆11Jan 20, 2025Updated last year
- An environment for tabular Reinforcement Learning agents.☆14Jun 13, 2018Updated 7 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- [IROS2020] Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas☆10Mar 25, 2023Updated 3 years ago
- Scalable Neural-Probabilistic Answer Set Programming☆18May 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- Unofficial website of security camp☆14Aug 19, 2025Updated 7 months ago
- A Keras-based recommendation engine for subreddits, channels on the popular social media site Reddit☆10Feb 24, 2024Updated 2 years ago
- A novel approach to learning concept embeddings and approximate reasoning in ALC knowledge bases with neural networks☆14Feb 7, 2023Updated 3 years ago
- The Pix2Code framework: generalizable, interpretable and revisable visual concept learning☆14Oct 7, 2025Updated 6 months ago
- Open AI Gym for ConnectFour game☆17Sep 21, 2022Updated 3 years ago
- Inference Llama 2 in one file of pure Java☆19Nov 13, 2023Updated 2 years ago