NewbieToEverything / Code-Mathmatical-Foundation-of-Reinforcement-Learning
This is the repository hosting the R scripts of the book "Mathematical Foundations of Reinforcement Learning" written by Yujun at Jiangxi Normal University.
☆21Updated last year
Alternatives and similar repositories for Code-Mathmatical-Foundation-of-Reinforcement-Learning:
Users that are interested in Code-Mathmatical-Foundation-of-Reinforcement-Learning are comparing it to the libraries listed below
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆98Updated 2 years ago
- Code for G2RL to solve the multi-robot path planning problem in a fully distributed reactive manner.☆61Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆70Updated 9 months ago
- Multi-UAV target round-up based on MADDPG☆125Updated last month
- Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning☆51Updated 3 weeks ago
- code for `Autonomous navigation of UAV in multi-obstacle environments based on a Deep Reinforcement Learning approach'☆32Updated last year
- Multi/Single UAV(unmanned aerial vehicle) path planning based on deep reinforcement learning☆209Updated last week
- ☆56Updated 2 years ago
- Reinforcement Learning for quadrotor trajectory planning and control☆48Updated last year
- [RA-Letter 2022] "Reinforcement Learned Distributed Multi-Robot Navigation with Reciprocal Velocity Obstacle Shaped Rewards"☆217Updated 3 months ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆129Updated last year
- 在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, an…☆100Updated last year
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆61Updated 2 years ago
- ☆48Updated 2 years ago
- Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing☆98Updated 2 years ago
- This code is the result of the collaboration of RL Turkey team.☆28Updated last year
- Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning☆96Updated last year
- study of reforcement learning☆26Updated last year
- The mirror of RL_Coding_Exercise.☆80Updated 6 months ago
- 基于ppo的路径规划☆31Updated last year
- ☆129Updated 5 months ago
- 复现一篇在网格环境中使用改进Q-learning进行路径规划的论文☆18Updated last year
- 强化学习大作业☆19Updated 9 months ago
- multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS☆72Updated 2 years ago
- Deep Reinforcement Learning based Adaptive Real-time Path Planning for UAV☆263Updated 3 years ago
- ☆15Updated last year
- ☆40Updated 3 years ago
- Multi-agent Combat Arena (UAV swarm vs UAV swarm)☆122Updated 4 years ago
- Training code PRIMAL2 - Public Repo☆173Updated 10 months ago
- A PettingZoo (https://pettingzoo.farama.org/) environment for maritime Capture the Flag with uncrewed surface vehicles (USVs).☆15Updated this week