hanjuku-kaso / awesome-offline-rlLinks
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,040Updated last year
Alternatives and similar repositories for awesome-offline-rl
Users that are interested in awesome-offline-rl are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆648Updated 4 years ago
- Code for conservative Q-learning☆462Updated 3 years ago
- An offline deep reinforcement learning library☆1,586Updated 2 months ago
- A collection of reference environments for offline reinforcement learning☆1,609Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆557Updated 2 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆367Updated 4 months ago
- Library for Model Based RL☆1,030Updated last year
- Imitation learning algorithms☆554Updated 7 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆518Updated 3 years ago
- PyTorch implementation of soft actor critic☆921Updated 4 months ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆763Updated last year
- A curated list of awesome imitation learning resources and publications☆597Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆376Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆569Updated 3 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆728Updated 3 years ago
- A curated list of awesome Deep Reinforcement Learning resources.☆829Updated 4 months ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,308Updated 8 months ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆852Updated last year
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,257Updated 4 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆283Updated 3 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆515Updated 2 years ago
- Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.☆971Updated 5 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆876Updated last year
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,364Updated last year
- The repository is for safe reinforcement learning baselines.☆723Updated 3 weeks ago
- ☆315Updated 3 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,275Updated 2 years ago
- ☆1,281Updated last year
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆714Updated 2 years ago
- PFRL: a PyTorch-based deep reinforcement learning library☆1,250Updated last year