saaries / Maze_DQN_reinforcement_learning
Use deep Q network to solve maze problem generated randomly, i.e. find the shortest path in a maze
☆8Updated last year
Related projects: ⓘ
- Deep reinforcement learning (DQN) implements robot path planning☆13Updated 3 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆26Updated last year
- created an environment of 10*10 grid and 4 UAVs to carry out coverage path planning cooperatively☆67Updated 2 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆43Updated last year
- Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning☆63Updated last year
- ☆77Updated 7 years ago
- ☆35Updated 2 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆89Updated 2 years ago
- Multi-agent RL algorithms are applied to large-scale cooperative target searching mission☆24Updated last year
- Deep reinforcement learning☆27Updated 2 years ago
- ☆15Updated 3 years ago
- 基于ppo的路径规划☆15Updated last year
- This is the source code of "Efficient training techniques for multi-agent reinforcement learning in combatant tasks".☆62Updated 3 years ago
- Python implementation of DDQN multi-UAV data harvesting☆145Updated 2 years ago
- Path Planning for the Robots Using Reinforcement Learning☆29Updated last year
- 路径规划强化学习☆39Updated 4 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- 无人机动态覆盖控制;1. 实现了一个无人机点覆盖环境;2. 给出了无人机连通保持规则;3. 给出了基于MARL的控制算法☆33Updated 3 weeks ago
- ☆10Updated 2 years ago
- ☆24Updated last year
- Deep Reinforcement Learning (RL) algorithms for underwater target tracking with Autonomous Underwater Vehicles (AUV)☆68Updated 2 years ago
- multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS☆63Updated 2 years ago
- A leader-follower formation control using deep reinforcement learning environment, In which every agent can learn to follow the leader ag…☆33Updated last year
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆28Updated 3 years ago
- ☆11Updated 3 years ago
- 利用深度强化学习的方法实现多智能体间离散无交流的障碍避免。其中强化学习算法训练模型所需的数据集由最优互惠碰撞避免(Optimal Reciprocal Collision Avoidance, ORCA)算法生成。☆71Updated 5 years ago
- 基于MADDPG的多智能体博弈对抗算法☆12Updated last year
- A Deep Learning Approach to UAV Path Planning☆56Updated 6 years ago
- Code for G2RL to solve the multi-robot path planning problem in a fully distributed reactive manner.☆41Updated last year
- gym 框架下的多智能体追逃博弈强化学习平台☆11Updated last year