Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆373Aug 1, 2019Updated 6 years ago
Alternatives and similar repositories for pg_travel
Users that are interested in pg_travel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆777Dec 22, 2023Updated 2 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,273Feb 9, 2021Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,886May 29, 2022Updated 3 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆16Jul 1, 2018Updated 7 years ago
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch☆1,080May 19, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- implementation of distributed reinforcement learning with distributed tensorflow☆57Jun 5, 2021Updated 4 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- PyTorch implementation of deep reinforcement learning algorithms☆490Nov 19, 2021Updated 4 years ago
- Collection of reinforcement learning algorithms☆2,888Jun 17, 2024Updated last year
- ☆21Feb 22, 2020Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Reinforcement Learning in PyTorch☆2,273Jan 4, 2021Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- ☆49Apr 15, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆693Dec 18, 2025Updated 3 months ago
- Repository for slides & codes of RL Korea Bootcamp☆41Oct 28, 2019Updated 6 years ago
- ☆57Mar 27, 2019Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- Sawyer environments for reinforcement learning using the OpenAI Gym interface (EXPERIMENTAL)☆37Dec 11, 2019Updated 6 years ago
- PyTorch implementation of soft actor critic☆940Jul 17, 2025Updated 8 months ago
- Structural implementation of RL key algorithms☆516Apr 8, 2023Updated 2 years ago
- Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)☆3,170Apr 22, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆375Oct 15, 2021Updated 4 years ago
- A repository for implementation of deep reinforcement learning lectured at Samsung☆110Sep 20, 2021Updated 4 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,680Aug 1, 2024Updated last year
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆877Dec 27, 2022Updated 3 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,317Sep 25, 2019Updated 6 years ago
- weekly reinforcement learning paper reviews☆33Jan 8, 2018Updated 8 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,663Jan 13, 2022Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,051Jul 14, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Beat The Bots Source Code☆13Nov 21, 2019Updated 6 years ago
- Minimal version of DeepMind AlphaZero☆85Dec 11, 2020Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Dec 20, 2017Updated 8 years ago
- [파이썬과 케라스로 배우는 강화학습] 예제☆386Oct 28, 2020Updated 5 years ago
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,169Nov 4, 2021Updated 4 years ago