Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆371Aug 1, 2019Updated 6 years ago
Alternatives and similar repositories for pg_travel
Users that are interested in pg_travel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆782Dec 22, 2023Updated 2 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,286Feb 9, 2021Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,900May 29, 2022Updated 4 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch☆1,079May 19, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- implementation of distributed reinforcement learning with distributed tensorflow☆57Jun 5, 2021Updated 5 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- PyTorch implementation of deep reinforcement learning algorithms☆487Nov 19, 2021Updated 4 years ago
- Collection of reinforcement learning algorithms☆2,906Jun 17, 2024Updated 2 years ago
- ☆21Feb 22, 2020Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Reinforcement Learning in PyTorch☆2,280Jan 4, 2021Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- ☆49Apr 15, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆696Dec 18, 2025Updated 6 months ago
- Repository for slides & codes of RL Korea Bootcamp☆41Oct 28, 2019Updated 6 years ago
- ☆57Mar 27, 2019Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆363Jun 2, 2020Updated 6 years ago
- Sawyer environments for reinforcement learning using the OpenAI Gym interface (EXPERIMENTAL)☆37Dec 11, 2019Updated 6 years ago
- PyTorch implementation of soft actor critic☆944Jul 17, 2025Updated 11 months ago
- Structural implementation of RL key algorithms☆517Apr 8, 2023Updated 3 years ago
- Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)☆3,206Apr 22, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆377Oct 15, 2021Updated 4 years ago
- A repository for implementation of deep reinforcement learning lectured at Samsung☆110Sep 20, 2021Updated 4 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,734Aug 1, 2024Updated last year
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆883Dec 27, 2022Updated 3 years ago
- weekly reinforcement learning paper reviews☆33Jan 8, 2018Updated 8 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,328Sep 25, 2019Updated 6 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,671Jan 13, 2022Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,086Jul 14, 2023Updated 2 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Beat The Bots Source Code☆13Nov 21, 2019Updated 6 years ago
- Minimal version of DeepMind AlphaZero☆85Dec 11, 2020Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Dec 20, 2017Updated 8 years ago
- [파이썬과 케라스로 배우는 강화학습] 예제☆387Oct 28, 2020Updated 5 years ago
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,178Nov 4, 2021Updated 4 years ago