This repository is a playground for beginners to learn reinforcement learning. It is a collection of simple environments and agents to get you started with reinforcement learning.
☆25Jul 30, 2024Updated last year
Alternatives and similar repositories for RL_PlayGround
Users that are interested in RL_PlayGround are comparing it to the libraries listed below
Sorting:
- This repository contains data and code related to TSG paper "Production Scheduling Identification: An Inverse Optimization Approach for I…☆14Apr 14, 2025Updated 10 months ago
- jw converter: 将某校教务平台的课表转换为 ICS 文件☆13Oct 26, 2025Updated 4 months ago
- 智能控制结课作业实验代码实现部分,包括模糊控制器和PID控制器实现以及控制器参数优化整定,PID参数采用Nelder-Mead优化,模糊控制器参数采用遗传算法优化。☆10Dec 2, 2024Updated last year
- This set of codes implements our TSG paper "Hierarchical Deep Learning Model for Degradation Prediction per Look-Ahead Scheduled Battery …☆11Feb 24, 2025Updated last year
- Keras 1D Depthwise Convolutional layer☆10May 22, 2020Updated 5 years ago
- A full-replica MATLAB/Simulink dynamic model of the IEEE 39-bus power system, including dynamic models of conventional generation and dyn…☆11Jun 18, 2018Updated 7 years ago
- VG-TechCenter / 001-Optimal-operation-considering-demand-response-under-the-carbon-trading-mechanism001-碳交易机制下考虑需求响应的优化运行☆14Oct 15, 2023Updated 2 years ago
- Exploration of techniques to solve tasks with a Panda robotic arm. Simulation based on PyBullet physics engine and gymnasium.☆10Mar 17, 2025Updated 11 months ago
- This repository is the demo implementation of [Deep Dimension Reduction for Supervised Representation Learning].☆11Sep 12, 2024Updated last year
- A MAPF Algorithm Visualizer☆11Mar 2, 2025Updated last year
- [AAMAS 2024] HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding☆13Mar 12, 2024Updated last year
- ☆12Mar 15, 2023Updated 2 years ago
- DC Optimal Power Flow (OPF) via gurobi and yalmip, respectively. An example for learning gurobi and yalmip. 分别通过gurobi和yalmip实现直流最优潮流。学习g…☆12Jul 29, 2025Updated 7 months ago
- RL_Dynamic_Network_Reconfiguration☆10Apr 13, 2023Updated 2 years ago
- The following code is a mixed integer linear programming (MILP) optimisation for an multi period economic load dispatch problem. It is im…☆14Feb 15, 2022Updated 4 years ago
- 用koch复现lerobot—遥操作数据采集—act复现—diffusion model复 现—Pi模型复现—视觉大模型☆26May 16, 2025Updated 9 months ago
- Mobile Manipulator control package for beverage serving automation☆11Dec 1, 2019Updated 6 years ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 7 months ago
- 一般印象,flask 项目适合做一些短小精悍的项目,特别是与 sqlite、mysql 等 数据库结合很是般配。但是在一些大公司,特别是一些金融行业等国企公司,还是以 oracle 居多,那么,这个小辣椒(flask)就无用武之地了吗?No, No, No... 下面将以 …☆11Jan 26, 2018Updated 8 years ago
- 2025华为杯研究生数学建模比赛☆22Sep 20, 2025Updated 5 months ago
- ☆11Feb 17, 2025Updated last year
- Change the login background image of Gnome Display Manager.☆10May 9, 2020Updated 5 years ago
- BRUCE_simulation_models☆12May 27, 2024Updated last year
- RLC4CLR employs curriculum learning to train a reinforcement learning controller (RLC) for a distribution system critical load restoratio…☆18Jul 3, 2025Updated 8 months ago
- Obstacle avoidance in UAVs with reinforcement learning (PPO)☆11Oct 6, 2022Updated 3 years ago
- 基于2016年电工杯数学建模竞赛数据集建立的超短期以及短期负荷预测☆17May 4, 2024Updated last year
- TS_SPMA: The Tabu Search algorithm for simultaneous scheduling problem of machines and AGVs.☆12Apr 30, 2021Updated 4 years ago
- Robotiq Gripper☆12Mar 9, 2020Updated 6 years ago
- Path Planning Basic Algorithm☆11Oct 11, 2021Updated 4 years ago
- ☆13Aug 17, 2020Updated 5 years ago
- Multivariate stochastic modeling for transcriptional dynamics with cell-specific latent time using SDEvelo☆19Jan 28, 2025Updated last year
- This repository contains various Transmission and Generation related optimization problem. The problems are solved using YALMIP toolbox w…☆14Jan 29, 2021Updated 5 years ago
- Neural Networks package for R with a fast C++ back-end and special support for unsupervised anomaly detection using autoencoders☆12Oct 9, 2025Updated 5 months ago
- Learning how to ride a bicycle using reinforcement learning.☆13Dec 11, 2013Updated 12 years ago
- ☆12Nov 11, 2019Updated 6 years ago
- Human - Robot Collaboration for fabric folding using Kinect2, RoboDK, Reflex 1 gripper and the ATI Force Torque Gamma sensor☆15Mar 1, 2023Updated 3 years ago
- MPNG☆11Sep 13, 2023Updated 2 years ago
- ☆10Dec 28, 2018Updated 7 years ago
- AGV system simulator for Reinforced Learning☆17Jul 12, 2022Updated 3 years ago