hanjuku-kaso / awesome-offline-rlLinks
An index of algorithms for offline reinforcement learning (offline-rl)
☆989Updated last year
Alternatives and similar repositories for awesome-offline-rl
Users that are interested in awesome-offline-rl are comparing it to the libraries listed below
Sorting:
- A collection of reference environments for offline reinforcement learning☆1,510Updated 7 months ago
- An offline deep reinforcement learning library☆1,497Updated last month
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆632Updated 4 years ago
- Code for conservative Q-learning☆446Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆551Updated 2 years ago
- Imitation learning algorithms☆538Updated 3 months ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆345Updated last year
- Library for Model Based RL☆1,003Updated 11 months ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆834Updated 10 months ago
- A curated list of awesome imitation learning resources and publications☆579Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆552Updated 3 years ago
- ☆1,197Updated last year
- PyTorch implementation of soft actor critic☆885Updated 3 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆686Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,256Updated 3 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆502Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆500Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆361Updated 3 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,310Updated last year
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆755Updated last year
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,223Updated last year
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆680Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆494Updated 2 years ago
- The repository is for safe reinforcement learning baselines.☆654Updated 2 months ago
- A curated list of awesome model based RL resources (continually updated)☆1,140Updated last month
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,225Updated 4 years ago
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,524Updated 5 months ago
- Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.☆938Updated 3 weeks ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆301Updated 2 years ago
- A curated list of Decision Transformer resources (continually updated)☆799Updated 4 months ago