chuiboy / GridWorld_-Planning-RL-View external linksLinks
Contains policy iteration and value iteration (planning). Also contains Q-learning (RL). Uses these methods in the context of the GridWorld problem where the agent's goal is to take the quickest path to reach the terminal state.
☆12May 19, 2018Updated 7 years ago
Alternatives and similar repositories for GridWorld_-Planning-RL-
Users that are interested in GridWorld_-Planning-RL- are comparing it to the libraries listed below
Sorting:
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆28Dec 12, 2023Updated 2 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- MATLAB code and data for the article 📋: I. Daminov, A. Prokhorov, R. Caire, M-C Alvarez-Herault, “Assessment of dynamic transformer rati…☆13Jun 23, 2024Updated last year
- Не курсач с сп а беды с башкой памагите☆15Dec 5, 2025Updated 2 months ago
- Machine learning project for forecasting energy demand in a microgrid in Bristol☆10May 12, 2021Updated 4 years ago
- My implementation of socket.io with websocket transport for use with Bun + Hono. API similar to Socket.io but this is another library. Fu…☆16Aug 27, 2025Updated 5 months ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Compare Q-Learning and Expected Value SARSA.☆11Oct 7, 2018Updated 7 years ago
- I-JEPA finetuning recipe☆12Jul 11, 2024Updated last year
- Simple demo for Databricks!☆14Sep 11, 2023Updated 2 years ago
- ☆14Sep 24, 2022Updated 3 years ago
- AI learns to drive a car using the NeuroEvolution algorithm using self-written OpenGL based 3D engine.☆15Jan 7, 2021Updated 5 years ago
- ☆14Jun 3, 2025Updated 8 months ago
- Fine-tune GPT2 to generate fake job experiences☆11Jan 17, 2023Updated 3 years ago
- Comwatt Integration for HomeAssistant☆22Jan 23, 2026Updated 3 weeks ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 6 years ago
- notes for community call for ZKML Community☆11Jan 18, 2023Updated 3 years ago
- ☆18Jun 18, 2024Updated last year
- Predictive Maintenance avoids the drawbacks of Preventive Maintenance (under utilization of a part's life) and Reactive Maintenance (unsc…☆17Apr 25, 2022Updated 3 years ago
- ☆18Oct 13, 2022Updated 3 years ago
- ☆18Feb 10, 2023Updated 3 years ago
- Multi Agent SAC and DDPG applied to path finding in a 3-dimensional grid☆15Aug 8, 2021Updated 4 years ago
- In electrical distribution systems, a great amount of power are wasting across the lines, also nowadays power factors, voltage profiles a…☆15Nov 11, 2018Updated 7 years ago
- Online Informative Path Planning for Active Information Gathering of a 3D Surface☆14Dec 20, 2021Updated 4 years ago
- fargo is a Farcaster CLI written in Go.☆20Dec 20, 2025Updated last month
- My replication code for the paper Pointer Networks.☆20Aug 29, 2022Updated 3 years ago
- Conversion tool for various grid data formats to power-grid-model☆23Updated this week
- A robotic arm demo, including high-tech technologies such as cloud, edge, mobile and image/speech recognition...☆16Dec 13, 2022Updated 3 years ago
- Implementations of various RL and Deep RL algorithms in TensorFlow, PyTorch and Keras.☆16Sep 18, 2024Updated last year
- Training Agents in a cooperative multi-agent deep reinforcement learning setting to transport objects across a space☆14Jul 5, 2021Updated 4 years ago
- Distributed RL Algorithms for Dynamic Energy Pricing in Microgrids☆18Jun 7, 2021Updated 4 years ago
- Frames.fun browser extension that implements open frames☆20Nov 21, 2024Updated last year
- ☆19May 15, 2024Updated last year
- Foundry is a blazing fast, portable and modular toolkit for Ethereum application development written in Rust.☆15Oct 25, 2023Updated 2 years ago
- Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)☆22Aug 10, 2020Updated 5 years ago
- Implementations of DQN, DQN with PER, DDQN with PER, and DDDQN with PER agents to maximise reward in a 14 node power grid station☆20Aug 22, 2020Updated 5 years ago
- Generate reliably random numbers using Foundry's FFI cheat code.☆22Dec 23, 2022Updated 3 years ago
- A simple template for making Farcaster/Warpcast frames using ExpressJS☆19Jan 29, 2024Updated 2 years ago
- ChannelKit is an open-source Next.js application built with Frog.js and deployed on Vercel, providing a customizable Farcaster Frame for …☆22Oct 15, 2024Updated last year