Notes and solutions to exercises in Sutton and Barto's Reinforcement Learning textbook
☆50Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for sutton_and_barto
Users that are interested in sutton_and_barto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Storing notes and smaller projects for the C++ Nanodegree program through Udacity☆14May 27, 2020Updated 5 years ago
- Solutions to Sutton and Barto book exercises☆132Mar 22, 2024Updated 2 years ago
- Genetic algorithm tuned through reinforcement learning☆17Jul 2, 2021Updated 4 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 3 years ago
- Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).☆404Jul 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A WebSocket server implementation for running Model Context Protocol (MCP) servers. This application enables MCP servers to be accessed v…☆20Mar 17, 2025Updated last year
- Coursera-Fundamentals of Reinforcement Learning Specialization.☆15May 19, 2024Updated last year
- Simple bash script that downloads and installs ANTs in a unix environment☆13Jan 28, 2025Updated last year
- C++ Interfaces for the nAG Library☆18Jul 24, 2025Updated 8 months ago
- ☆13Feb 24, 2022Updated 4 years ago
- A collection of open-source projects on the Traffic Assignment Problem☆14Sep 11, 2023Updated 2 years ago
- Companion code to TRC paper: Daniel A. Lazar, Erdem Bıyık, Dorsa Sadigh, Ramtin Pedarsani. "Learning how to Dynamically Route Autonomous …☆16Aug 9, 2021Updated 4 years ago
- pipeline for volumetric cell segmentation☆14Feb 12, 2026Updated 2 months ago
- This repository contains code related to a research paper I've been working on titled "Dynamic traffic assignment with a node-based cell …☆18Aug 23, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- trust and specialty projections of waiting lists, for all trusts in england, updated each month☆10May 16, 2023Updated 2 years ago
- Notes and exercise solutions for second edition of Sutton & Barto's book☆405Oct 2, 2022Updated 3 years ago
- CS-541 Deep Learning is a graduate class that teaches both a theoretical and practical approach to deep learning. You will be able to see…☆10Aug 31, 2021Updated 4 years ago
- ☆12Sep 29, 2021Updated 4 years ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆14Feb 28, 2025Updated last year
- A rich and diverse dataset created with GPT-4 for training and evaluating conversational models in Hinglish☆15Aug 31, 2023Updated 2 years ago
- sumo + RL + Routing☆19May 6, 2022Updated 3 years ago
- A python implementation of the cell transmission model (CTM) for macroscopic traffic flow simulation.☆23Dec 6, 2021Updated 4 years ago
- ☆26Sep 4, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 日本の飲食店オープンデータ☆12May 6, 2021Updated 4 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆19Jun 3, 2024Updated last year
- SUMO chinese document translation project☆21Jun 5, 2021Updated 4 years ago
- 🛸 Developed PID controllers for controlling quadrotors in 1-D, 2-D, and 3-D control in MATLAB simulation environment.☆16Aug 29, 2021Updated 4 years ago
- Stochastic Logic Programs (SLP) style probabilistic logic programming in miniKanren☆34Feb 3, 2013Updated 13 years ago
- A markerless, low monetary cost, accessible approach to human gait analysis using an OpenPose-based 2D estimation system for knee flexion…☆34Feb 2, 2024Updated 2 years ago
- DimmWitted Gibbs Sampler in C++ — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👉🏿☆17Jan 23, 2017Updated 9 years ago
- ⭐ My own world.☆17Updated this week
- Probabilistic Graphical Models in Python3.☆24Jun 4, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jun 28, 2022Updated 3 years ago
- Simple heat and power model of Germany☆12Jun 27, 2022Updated 3 years ago
- Code for NeurIPS 2024 paper "A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models"☆15Oct 17, 2024Updated last year
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- SUMO Tutorial, including net, route, traffic light, detector, sumolib and traci.☆23Aug 16, 2022Updated 3 years ago
- For building quantum neural networks in Qiskit and integrating with PyTorch☆19Dec 13, 2021Updated 4 years ago
- Beer Game implemented as an OpenAI gym environment.☆17Aug 4, 2019Updated 6 years ago