thiagov / pacman-ai
A Monte Carlo Tree Search Agent used to control agents in a Pacman competition.
☆16Updated 10 years ago
Alternatives and similar repositories for pacman-ai:
Users that are interested in pacman-ai are comparing it to the libraries listed below
- Monte Carlo tree search (MCTS) on traveling salesman problem (TSP)☆22Updated 6 years ago
- a pacman AI with a reinforcement learning agent that utilizes value iteration, policy iteration, policy extraction, Q-learning.☆24Updated 12 years ago
- Implementation of value iteration algorithm for calculating an optimal MDP policy☆97Updated 7 years ago
- Demo of UCT (MCTS) in Python / Numpy☆85Updated 2 years ago
- Multiagent reinforcement learning simulation framework - Undergraduate thesis in Mechatronics Engineering at the University of Brasília☆67Updated 6 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆84Updated 6 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆19Updated 8 years ago
- Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.☆69Updated 9 years ago
- Bin Packing Problem using Neural Combinatorial Optimization.☆107Updated 2 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆38Updated 7 years ago
- General Purpose C++ Implementation for Inference and Learning in Bayesian and Markov Networks☆16Updated 6 years ago
- Half Field Offense in Robocup 2D Soccer☆232Updated 2 years ago
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆32Updated 4 years ago
- reinforcement learning ddpg code. follow deepmind papers.☆60Updated 7 years ago
- Multi-Objective Deep Reinforcement Learning☆43Updated 8 years ago
- Deep Reinforcement Learning in Pac-man☆283Updated last year
- pytorch neural combinatorial optimization☆383Updated 7 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆113Updated 7 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Updated 8 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Updated 7 years ago
- Tensorflow implementation of an Actor Critic algorithm using a Pointer Network to solve the TSP (algorithm from Neural Combinatorial Opti…☆44Updated 7 years ago
- A reimplementation of the Google AlphaZero algorithm.☆18Updated 4 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆71Updated 4 years ago
- ☆76Updated 7 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Updated 6 years ago
- ☆42Updated this week
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆30Updated 5 years ago
- Solving POMDP using Recurrent networks☆86Updated 4 years ago