microsoft / FQF
FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for Atari games, which can learn to play Atari games automatically by predicting return distribution in the form of a fully parameterized quantile function.
☆42Updated 4 years ago
Alternatives and similar repositories for FQF:
Users that are interested in FQF are comparing it to the libraries listed below
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 2 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆149Updated 4 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆31Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆124Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆89Updated 9 months ago
- Revisiting Rainbow☆74Updated 3 years ago
- Soft Actor-Critic with advanced features☆49Updated this week
- Soft Actor-Critic☆144Updated 7 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- ☆29Updated 2 years ago
- ☆31Updated 5 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 9 months ago
- ☆53Updated last year
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated last month
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆168Updated 3 months ago
- Implementation of the Option-Critic Architecture☆39Updated 6 years ago
- ☆47Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last month
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆51Updated 3 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆150Updated 4 years ago
- Hindsight policy gradients☆45Updated 5 years ago
- A simple framework for distributed reinforcement learning in PyTorch.☆16Updated 5 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago