deepeshmittal / AI-PacMan-Projects
Implemented various AI algorithms in Pac-Man projects developed by UC Berkeley. Implemented informed/blind state-space search using search algorithms like BFS, DFS, UCS and A* algorithm with heuristic calculation. Designed an algorithm for reflex agent, minimax and alpha-beta pruning. Reinforcement Learning using MDP (Value Iteration and Policy …
☆22Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for AI-PacMan-Projects
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- LSTM-based model for answer selection☆19Updated 5 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆34Updated 5 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 7 years ago
- Implementations of Reinforcement Learning Algorithm☆40Updated 6 years ago
- PyTorch port of the paper "Language to Logical Form with Neural Attention"☆38Updated 4 years ago
- Assignments for CS294-112 Deep Reinforcement Learning in UC Berkeley in Fall 2018☆16Updated 5 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- Tensorflow implementation of DCN for question answering on the Stanford Question Answering Dataset (SQuAD)☆14Updated 6 years ago
- Replication of the paper "Structured Neural Summarization" which uses Graph Neural Networks and Seq2Seq models to summarize natural langu…☆21Updated 5 years ago
- ☆11Updated last year
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆74Updated 4 years ago
- Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning☆151Updated 6 years ago
- Implementation of value iteration algorithm for calculating an optimal MDP policy☆93Updated 6 years ago
- Reimplementation of NLP Style Transfer from Non-parallel Text with Adversarial Alignment (https://arxiv.org/abs/1705.09655)☆14Updated 3 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 5 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- An implementation of N-aryTreeLSTM in pyTorch and mxNet☆8Updated 3 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 6 years ago
- Article sample preview(Not yet finished)☆9Updated 5 years ago
- ☆14Updated 7 years ago
- Implementation of Adversarial Ranking for Language Generation [ArxiV 1705.11001]☆31Updated 6 years ago
- hierarchical Q-learning implementation☆11Updated 9 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- This is the code of "Adversarial Connective-exploiting Networks for Implicit Discourse Relation Classification" in ACL 2017☆19Updated 6 years ago
- TensorFlow implementation of Pointer Networks, modified to use a threshold (or hardmax) pointer instead of a softmax pointer.☆40Updated 7 years ago
- An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective (EMNLP17)☆9Updated 5 years ago
- classification of the imdb large movie review dataset☆36Updated 5 years ago
- reinforcement learning for tsp by pointer network☆28Updated 7 years ago