shivamsaboo17 / Policy-Gradient-PyTorch
Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Policy-Gradient-PyTorch
- Hierarchical-DQN in pytorch (not actively maintained)☆68Updated 7 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆207Updated last year
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆65Updated last year
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆96Updated 5 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆82Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆174Updated 6 years ago
- ☆118Updated 4 months ago
- pytorch implementation of DQN, NAF, DDPG☆13Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆199Updated 5 years ago
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆34Updated 5 years ago
- Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout☆88Updated 6 years ago
- DGN Code☆336Updated last year
- ☆71Updated 5 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- A toy example of Policy Gradient implemented in Pytorch☆91Updated 6 years ago
- ☆43Updated last year
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆368Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated last month
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- Actor Critic model to play Cartpole game☆52Updated 6 years ago
- ☆83Updated 3 years ago
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆90Updated 5 years ago
- research and implementations of Deep RL agents and their applications☆47Updated 3 weeks ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 6 years ago