MeepMoop / tilecoding
Binary feature representations with tile coding
☆45Updated 7 months ago
Alternatives and similar repositories for tilecoding:
Users that are interested in tilecoding are comparing it to the libraries listed below
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆300Updated 2 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- ☆194Updated 2 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆312Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆355Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 8 months ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆152Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 9 months ago
- PyTorch implementation of SAC-Discrete.☆304Updated 9 months ago
- Constrained Policy Optimization☆318Updated 7 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆160Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 3 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆63Updated 3 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆187Updated 2 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆166Updated 5 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated 11 months ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆188Updated 2 years ago
- ☆299Updated 4 months ago
- Code for the paper "Phasic Policy Gradient"☆261Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆139Updated 6 years ago
- There will be updates later☆84Updated 5 years ago
- Gridworld for MARL experiments☆139Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆446Updated last year