thiagov / pacman-aiLinks
A Monte Carlo Tree Search Agent used to control agents in a Pacman competition.
☆16Updated 10 years ago
Alternatives and similar repositories for pacman-ai
Users that are interested in pacman-ai are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning in Pac-man☆284Updated last year
- a pacman AI with a reinforcement learning agent that utilizes value iteration, policy iteration, policy extraction, Q-learning.☆24Updated 12 years ago
- Demo of UCT (MCTS) in Python / Numpy☆86Updated 2 years ago
- Simple Game with Monte Carlo Tree Search☆58Updated 7 years ago
- Monte Carlo tree search (MCTS) on traveling salesman problem (TSP)☆22Updated 6 years ago
- Single Player Monte Carlo Tree Search implementation☆18Updated 5 years ago
- Implementation of value iteration algorithm for calculating an optimal MDP policy☆97Updated 7 years ago
- Board game AI implementations using Monte Carlo Tree Search☆184Updated 5 years ago
- Hybrid Reward Architecture☆77Updated 7 years ago
- Python implementation of POMDP framework and PBVI & POMCP algorithms.☆112Updated 3 years ago
- POMDPs in Python.☆248Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 8 months ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- DQN to play Atari Pong☆114Updated 6 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆23Updated 6 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- The original implementation of HyperNEAT in C++☆28Updated 9 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆38Updated 7 years ago
- Value & Policy Iteration for the frozenlake environment of OpenAI☆15Updated 6 years ago
- Maximum Entropy Inverse Reinforcement Learning for predicting pedestrian movement☆9Updated 7 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Updated 4 years ago
- ☆18Updated 6 years ago
- Training a vision-based agent with the Deep Q Learning Network (DQN) in Atari's Breakout environment, implementation in Tensorflow.☆16Updated 6 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆29Updated last year
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆370Updated last year
- Use Asynchronous advantage actor-critic algorithm (A3C) to play Flappy Bird using Keras☆39Updated 7 years ago