marcinbogdanski / rl-sketchpadView external linksLinks
Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.
☆38Mar 7, 2020Updated 5 years ago
Alternatives and similar repositories for rl-sketchpad
Users that are interested in rl-sketchpad are comparing it to the libraries listed below
Sorting:
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Reference code base for ML Engineering in Action, Manning Publications Author: Ben Wilson☆20Oct 22, 2023Updated 2 years ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- This repository contains resources, documentation and artifacts describing LLM agents☆14Jan 22, 2025Updated last year
- Applications of reinforcement learning to Groebner basis computation.☆15Jun 13, 2021Updated 4 years ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆10Nov 29, 2020Updated 5 years ago
- ☆20Feb 18, 2025Updated 11 months ago
- ☆14Apr 22, 2024Updated last year
- Everything you need to know for data science.☆21Jan 10, 2023Updated 3 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Dec 2, 2024Updated last year
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆17Apr 25, 2022Updated 3 years ago
- MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.☆13Jun 14, 2022Updated 3 years ago
- Lab files of IBM's Qiskit Global Summer School 2020.☆17Sep 3, 2020Updated 5 years ago
- Improving langchain knowledge graphs using baml☆43Aug 3, 2025Updated 6 months ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- This project is focused on the Deployment phase of machine learning. The Docker and FastAPI are used to deploy a dockerized server of tra…☆27Jan 7, 2023Updated 3 years ago
- Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras base on tutorial of Jason Brownlee☆21Jul 23, 2018Updated 7 years ago
- 🤓 A collection of AWESOME structured summaries of Large Language Models (LLMs)☆31Sep 7, 2023Updated 2 years ago
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆40Apr 15, 2025Updated 10 months ago
- Deep Q Network implements by Tensorflow☆25Mar 9, 2018Updated 7 years ago
- Solutions for different Reinforcement Learning environments☆26Aug 2, 2024Updated last year
- Stock market data can be interesting to analyze and as a further incentive, strong predictive models can have large financial payoff. The…☆25May 2, 2018Updated 7 years ago
- Learning Tensorflow Step by Step:: Concepts, Examples & Applications☆56Jun 18, 2025Updated 7 months ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- A walk through HuggingFace smolagents☆48Mar 7, 2025Updated 11 months ago
- Implementation of HEFT (Heterogeneous Earliest Finish Time) DAG Scheduling Algorithm in Python☆33Dec 17, 2022Updated 3 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Source code for Reinforcement Learning in Scala talk☆29Nov 27, 2018Updated 7 years ago
- LSTM for time series forecasting☆28Nov 12, 2017Updated 8 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 5 years ago
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- Intelligent Document Processing with AWS AI/ML, published by Packt☆10Feb 5, 2026Updated last week