Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.
☆38Mar 7, 2020Updated 6 years ago
Alternatives and similar repositories for rl-sketchpad
Users that are interested in rl-sketchpad are comparing it to the libraries listed below
Sorting:
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- Applications of reinforcement learning to Groebner basis computation.☆14Jun 13, 2021Updated 4 years ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆10Nov 29, 2020Updated 5 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- ☆14Apr 22, 2024Updated last year
- ☆20Feb 18, 2025Updated last year
- Workshop for Model Context Protocol☆17Mar 27, 2025Updated 11 months ago
- Everything you need to know for data science.☆21Jan 10, 2023Updated 3 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 3 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Dec 2, 2024Updated last year
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆26Jan 27, 2026Updated last month
- Lab files of IBM's Qiskit Global Summer School 2020.☆17Sep 3, 2020Updated 5 years ago
- MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.☆13Jun 14, 2022Updated 3 years ago
- Improving langchain knowledge graphs using baml☆42Aug 3, 2025Updated 7 months ago
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated last year
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Regression in Convolutional Neural Network applied to Plant Leaf Count☆19Sep 6, 2022Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- A collection of various projects related to Reinforcement Learning☆19Feb 22, 2021Updated 5 years ago
- This project is focused on the Deployment phase of machine learning. The Docker and FastAPI are used to deploy a dockerized server of tra…☆27Jan 7, 2023Updated 3 years ago
- ☆31Jul 18, 2024Updated last year
- 🤓 A collection of AWESOME structured summaries of Large Language Models (LLMs)☆31Sep 7, 2023Updated 2 years ago
- Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras base on tutorial of Jason Brownlee☆21Jul 23, 2018Updated 7 years ago
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆28Jan 19, 2024Updated 2 years ago
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆40Apr 15, 2025Updated 10 months ago
- Deep Q Network implements by Tensorflow☆25Mar 9, 2018Updated 8 years ago
- Stock market data can be interesting to analyze and as a further incentive, strong predictive models can have large financial payoff. The…☆26May 2, 2018Updated 7 years ago
- Solutions for different Reinforcement Learning environments☆26Aug 2, 2024Updated last year
- Learning Tensorflow Step by Step:: Concepts, Examples & Applications☆56Jun 18, 2025Updated 8 months ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- A walk through HuggingFace smolagents☆49Mar 7, 2025Updated last year
- Source code for Reinforcement Learning in Scala talk☆29Nov 27, 2018Updated 7 years ago
- Code accompanying the paper: Elena Ricciardelli, Debmalya Biswas. Self-improving Chatbots based on Reinforcement Learning. In proceedings…☆24Jan 2, 2022Updated 4 years ago