MeepMoop / tilecoding
Binary feature representations with tile coding
☆45Updated 8 months ago
Alternatives and similar repositories for tilecoding
Users that are interested in tilecoding are comparing it to the libraries listed below
Sorting:
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆302Updated 9 months ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆315Updated 3 years ago
- ☆195Updated 2 years ago
- Constrained Policy Optimization☆320Updated 7 years ago
- Keeping track of RL experiments☆161Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆306Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 9 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆200Updated 2 years ago
- A customizable framework to create maze and gridworld environments☆265Updated 6 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆177Updated 7 years ago
- ☆300Updated 4 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆140Updated 6 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- Implementation of Soft Actor Critic☆37Updated 3 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆142Updated last year
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆116Updated 2 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆128Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆542Updated 3 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆63Updated 3 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆491Updated 2 years ago
- ☆75Updated 5 years ago
- Random Network Distillation pytorch☆247Updated 6 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆128Updated 9 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆191Updated 2 years ago