ymd-h / cpprbLinks
Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)
☆72Updated 7 months ago
Alternatives and similar repositories for cpprb
Users that are interested in cpprb are comparing it to the libraries listed below
Sorting:
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆150Updated 4 years ago
- A collection of RL algorithms written in JAX.☆102Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆219Updated last year
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆120Updated 11 months ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Updated 5 years ago
- Performances of Reinforcement Learning Agents☆53Updated 5 years ago
- ☆113Updated 2 years ago
- impact-driven-exploration☆131Updated last year
- ☆30Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- ☆200Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆159Updated last month
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆152Updated 4 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆88Updated 4 years ago
- ☆306Updated 7 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆182Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆190Updated 2 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- ☆31Updated 6 years ago
- ☆22Updated 4 years ago
- Soft Actor-Critic☆151Updated 7 years ago
- Hindsight policy gradients☆45Updated 5 years ago
- Benchmarking TD3 and DDPG on PyBullet☆54Updated 6 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆93Updated last year
- Accelerated Methods for Deep Reinforcement Learning☆48Updated 6 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 5 years ago
- ☆350Updated 2 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated 2 years ago