andy-psai / MountainCar_ActorCriticLinks
TD Advantage Actor-Critic RL algorithm
☆15Updated 6 years ago
Alternatives and similar repositories for MountainCar_ActorCritic
Users that are interested in MountainCar_ActorCritic are comparing it to the libraries listed below
Sorting:
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆268Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆127Updated 5 years ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12Updated 5 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆375Updated 2 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆357Updated 5 years ago
- Tools for accelerating safe exploration research.☆574Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆527Updated 3 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆332Updated 4 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆331Updated 2 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆439Updated 4 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Updated 5 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆176Updated 3 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆452Updated 2 years ago
- ☆92Updated 5 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Updated 6 years ago
- Lightweight multi-agent gridworld Gym environment☆212Updated 2 years ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆194Updated 3 years ago
- Simple Cartpole example writed with pytorch.☆167Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Updated 6 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆764Updated 2 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Updated last year
- Proximal Policy Optimization implementation with TensorFlow☆108Updated 7 years ago
- Binary feature representations with tile coding☆46Updated last year
- ☆275Updated 7 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆98Updated 5 years ago
- PyTorch implementation of SAC-Discrete.☆313Updated last year
- List of competitions related to Reinforcement Learning☆350Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆579Updated 4 years ago