hanjuku-kaso / awesome-offline-rlLinks
An index of algorithms for offline reinforcement learning (offline-rl)
☆1,007Updated last year
Alternatives and similar repositories for awesome-offline-rl
Users that are interested in awesome-offline-rl are comparing it to the libraries listed below
Sorting:
- A collection of reference environments for offline reinforcement learning☆1,545Updated 8 months ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆638Updated 4 years ago
- An offline deep reinforcement learning library☆1,528Updated 2 months ago
- Code for conservative Q-learning☆451Updated 3 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆353Updated last month
- Imitation learning algorithms☆545Updated 4 months ago
- Library for Model Based RL☆1,011Updated last year
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆760Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆553Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆367Updated 3 years ago
- The repository is for safe reinforcement learning baselines.☆680Updated 2 weeks ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆510Updated 2 years ago
- A curated list of awesome imitation learning resources and publications☆582Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆555Updated 3 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆836Updated 11 months ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,267Updated 4 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆813Updated last year
- PyTorch implementation of soft actor critic☆896Updated 3 weeks ago
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,559Updated 7 months ago
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆480Updated 5 months ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,237Updated 4 years ago
- PFRL: a PyTorch-based deep reinforcement learning library☆1,242Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆507Updated 2 years ago
- High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…☆1,237Updated 2 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆704Updated 2 years ago
- DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…☆335Updated 2 years ago
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,534Updated last week
- ☆1,214Updated last year
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆499Updated 2 years ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆278Updated 3 years ago