MorvanZhou / mmaze
A python maze generator and solver
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mmaze
- path finding algorithms☆18Updated 7 months ago
- A* (A-Star) algorithm for finding the shortest path in a maze☆15Updated 3 years ago
- Evolutionary algorithms, alternative to Reinforcement Learning☆36Updated last year
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆16Updated 3 years ago
- ☆16Updated 3 years ago
- Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)☆21Updated 3 years ago
- ☆23Updated last year
- 天授中文文档☆55Updated 2 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆32Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- ☆10Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆36Updated last year
- Framework to build and train RL algorithms☆36Updated 3 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆28Updated last year
- Graph convolutional memory for reinforcement learning☆20Updated 3 years ago
- [ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.☆56Updated last year
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆23Updated last year
- ☆35Updated 4 years ago
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10Updated 6 years ago
- RL Algorithms☆13Updated last year
- Decision Transformer: A brand new Offline RL Pattern.☆34Updated 2 years ago
- Using (deep) reinforcement_learning algorithm to practice on OpenAI Gym, Unity ML-Agents,and other virtual environments. Using Python ,Py…☆15Updated 4 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆50Updated 4 months ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆45Updated 7 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆25Updated last year
- ☆16Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- very easy implementation of dueling DQN in pytorch☆69Updated last year