deepeshmittal / AI-PacMan-Projects
Implemented various AI algorithms in Pac-Man projects developed by UC Berkeley. Implemented informed/blind state-space search using search algorithms like BFS, DFS, UCS and A* algorithm with heuristic calculation. Designed an algorithm for reflex agent, minimax and alpha-beta pruning. Reinforcement Learning using MDP (Value Iteration and Policy …
☆22Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for AI-PacMan-Projects
- Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018☆16Updated 6 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 5 years ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 7 years ago
- Reinforcement learning of driving a racing car in TORCS using DDPG algorithm☆14Updated 6 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- Implementation of value iteration algorithm for calculating an optimal MDP policy☆93Updated 6 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- LSTM-based model for answer selection☆19Updated 6 years ago
- Reinforcement Learning For Dialogue Systems 强化学习在对话系统中的应用 论文或开源应用总结☆28Updated 4 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- hierarchical Q-learning implementation☆11Updated 9 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆34Updated 5 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Updated 5 years ago
- Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning☆151Updated 6 years ago
- Autonomous vehicle learn how to navigate efficiently at crossroad☆16Updated 6 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆74Updated 4 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆62Updated 7 years ago
- ☆53Updated 7 years ago
- Autonomous Driving on Carla simulator using Deep Deterministic Policy Gradients. Based on Kendall, et. al. 2018.☆11Updated 5 years ago
- Article sample preview(Not yet finished)☆9Updated 5 years ago
- A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…☆25Updated 6 months ago
- Sorting numbers with pointer networks☆56Updated 6 years ago
- ☆11Updated last month
- Deep Learning Experiments Motivated from Fastai Course☆14Updated 5 years ago
- ☆11Updated 5 years ago
- TensorFlow implementation of Pointer Networks, modified to use a threshold (or hardmax) pointer instead of a softmax pointer.☆40Updated 7 years ago
- ☆11Updated last year