Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.
☆42Mar 7, 2020Updated 6 years ago
Alternatives and similar repositories for rl-sketchpad
Users that are interested in rl-sketchpad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilson☆21Oct 22, 2023Updated 2 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 4 years ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Save and load entire workspaces containins pandas objects and numpy arrays☆15Oct 5, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.☆69Jul 17, 2025Updated 9 months ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- Never fill a sockaddr_in struct by hand again!☆13Apr 10, 2020Updated 6 years ago
- ☆20Feb 18, 2025Updated last year
- MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.☆13Jun 14, 2022Updated 3 years ago
- lazy_dataset: Process large datasets as if it was an iterable.☆18Dec 1, 2025Updated 4 months ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆20Jan 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆11Nov 29, 2020Updated 5 years ago
- Reinforcement Learning framework to make synthetic experiments in the financial domain☆23Jul 18, 2023Updated 2 years ago
- Improving langchain knowledge graphs using baml☆43Aug 3, 2025Updated 8 months ago
- ☆12Nov 9, 2020Updated 5 years ago
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆26Jan 27, 2026Updated 3 months ago
- ☆31Jul 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Neural Turing Machine for a Multi-Processor System on Chip verified with UVM/OSVVM/FV☆12Apr 20, 2026Updated last week
- ZeroMat as presented at ICISCAE 2021☆12Jun 2, 2022Updated 3 years ago
- An rigorous, well documented machine learning analysis pipeline for binary classification datasets assembled as a Jupyter Notebook. Inclu…☆11Sep 1, 2020Updated 5 years ago
- Unity Networking Library Benchmark on Bad Network Conditions☆17Sep 1, 2025Updated 7 months ago
- Generative Adversarial Network to create synthetic time series☆23Jul 22, 2020Updated 5 years ago
- ENet reliable UDP networking library modified to use Network Next☆16Jul 10, 2025Updated 9 months ago
- Official codebase for our NeurIPS paper, Symmetry-Informed Governing Equation Discovery.☆11Nov 13, 2024Updated last year
- ☆11Aug 5, 2022Updated 3 years ago
- NSynth for the rest of us☆14May 12, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- React.js Babylon.js WebGL project☆10Jan 25, 2022Updated 4 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆21Aug 26, 2022Updated 3 years ago
- This project is focused on the Deployment phase of machine learning. The Docker and FastAPI are used to deploy a dockerized server of tra…☆27Jan 7, 2023Updated 3 years ago
- 图像处理接口:图像解模糊(deblurring)和图像超分辨率还原(Super-resolution)深度学习框架tensorflow和torch,并实现web后端基于python-Flask框架的接口,python语言☆13Sep 25, 2019Updated 6 years ago
- Simulated Model Predictive Controller (MPC) for an inverted pendulum on a cart in Python☆17Nov 4, 2020Updated 5 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago