Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.
☆42Mar 7, 2020Updated 6 years ago
Alternatives and similar repositories for rl-sketchpad
Users that are interested in rl-sketchpad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- Applications of reinforcement learning to Groebner basis computation.☆14Jun 13, 2021Updated 4 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆22Dec 2, 2024Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Jan 2, 2025Updated last year
- In this repo, I used some math and image manipulation skills to create my own reinforcement learning environnement for autonomous car☆12Jun 24, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Aug 6, 2025Updated 10 months ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- A collection of various projects related to Reinforcement Learning☆19Feb 22, 2021Updated 5 years ago
- Code repository for the paper "Learning partial differential equations for biological transport models from noisy spatiotemporal data"☆10Jul 3, 2019Updated 6 years ago
- ☆20Feb 18, 2025Updated last year
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- Code for the figures in Chapter 13 of "Reinforcement Learning: An Introduction" by Sutton and Barto☆14Jul 6, 2023Updated 2 years ago
- Oil Palm Tree Counting in Drone Image☆11Jun 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Aug 4, 2020Updated 5 years ago
- The Vanta Control Set maps common compliance standards from their requirements to controls and provides them in an easy to consume machin…☆20Aug 26, 2021Updated 4 years ago
- Regression in Convolutional Neural Network applied to Plant Leaf Count☆19Sep 6, 2022Updated 3 years ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆11Nov 29, 2020Updated 5 years ago
- Python + Numpy + Scipy Implementation of LARS and LASSO☆12Oct 19, 2010Updated 15 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm☆12May 19, 2020Updated 6 years ago
- Reinforcement Learning framework to make synthetic experiments in the financial domain☆23Jul 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Empirical Analysis of EIP-1559: Transaction Fees, Waiting Time, and Consensus Security☆22Apr 25, 2023Updated 3 years ago
- Everything you need to know for data science.☆21Jan 10, 2023Updated 3 years ago
- Improving langchain knowledge graphs using baml☆45Aug 3, 2025Updated 10 months ago
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆28Jan 27, 2026Updated 4 months ago
- ☆11Aug 22, 2017Updated 8 years ago
- Hide the desktop icons for macOS☆12Mar 13, 2018Updated 8 years ago
- posenet+LSTM implementation with Keras& TensorFlow☆16Nov 28, 2019Updated 6 years ago
- Tutorial on NetworkX originally given at NetsciX 2016 School of Code☆15Jul 22, 2024Updated last year
- Neural Turing Machine for a Multi-Processor System on Chip verified with UVM/OSVVM/FV☆12May 29, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ZeroMat as presented at ICISCAE 2021☆12Jun 2, 2022Updated 4 years ago
- Unity Networking Library Benchmark on Bad Network Conditions☆17Sep 1, 2025Updated 9 months ago
- Generative Adversarial Network to create synthetic time series☆23Jul 22, 2020Updated 5 years ago
- ENet reliable UDP networking library modified to use Network Next☆16Jul 10, 2025Updated 10 months ago
- NSynth for the rest of us☆14May 12, 2017Updated 9 years ago
- projects about NLP knowledge graph, web crawling, word embedding, entity&relation extraction.☆13Dec 8, 2022Updated 3 years ago
- Tutorial and examples of Jackson APIs☆13Jul 21, 2019Updated 6 years ago