Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
☆100Feb 1, 2020Updated 6 years ago
Alternatives and similar repositories for simple-A2C-PPO
Users that are interested in simple-A2C-PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆52Feb 4, 2020Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.☆22Nov 20, 2017Updated 8 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆66Jul 13, 2017Updated 8 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,898May 29, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of MCTS algorithms in Munos (2014)☆13Aug 8, 2018Updated 7 years ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncio☆15Feb 2, 2021Updated 5 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Feb 22, 2021Updated 5 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆182Apr 2, 2023Updated 3 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- OpenAI Gym Environments for the Application of Reinforcement Learning in the Simulation of Wireless Networked Feedback Control Loops☆15Feb 5, 2021Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Feb 10, 2019Updated 7 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆65Sep 6, 2023Updated 2 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆39Oct 12, 2021Updated 4 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆147Mar 12, 2023Updated 3 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Apr 20, 2020Updated 6 years ago
- Online courses and books I use for ML & CV, Math, CompSci and others☆14Oct 10, 2018Updated 7 years ago
- ☆11Nov 8, 2019Updated 6 years ago
- ☆12Jul 4, 2022Updated 3 years ago
- Data pipeline for streaming, processing, and analyzing the GDELT global events dataset.☆11Mar 11, 2017Updated 9 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Jun 9, 2018Updated 7 years ago
- A Deep Reinforcement Learning Network for Traffic Light Cycle Control☆51Mar 22, 2021Updated 5 years ago
- Pytorch Implementation of Proximal Policy Optimization Algorithm☆20Mar 7, 2018Updated 8 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Dec 23, 2016Updated 9 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆107Jun 7, 2019Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆51Nov 13, 2018Updated 7 years ago
- RL agent using private and shared world models☆11Jun 12, 2023Updated 2 years ago
- ☆30May 1, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆694Dec 18, 2025Updated 4 months ago
- Experiments with transformer based RL algorithms☆22Nov 23, 2019Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆338Nov 24, 2021Updated 4 years ago
- CS234 Project, Winter 2019☆10Mar 20, 2019Updated 7 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Implementing expectimax, alpha-beta pruning, and minimax algorithms in a game of Pacman☆11Jan 17, 2014Updated 12 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆271May 20, 2020Updated 5 years ago