Farama-Foundation / D4RL
A collection of reference environments for offline reinforcement learning
☆1,348Updated this week
Related projects ⓘ
Alternatives and complementary repositories for D4RL
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,222Updated 11 months ago
- Library for Model Based RL☆963Updated 4 months ago
- PyTorch implementation of soft actor critic☆816Updated 3 years ago
- Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning☆1,275Updated 2 weeks ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,130Updated 11 months ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆515Updated 2 years ago
- Code for conservative Q-learning☆410Updated 2 years ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆649Updated 7 months ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆598Updated 3 years ago
- ☆1,090Updated 5 months ago
- An offline deep reinforcement learning library☆1,327Updated this week
- A curated list of awesome imitation learning resources and publications☆526Updated 9 months ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆929Updated 5 months ago
- Soft Actor-Critic☆1,006Updated 11 months ago
- Imitation learning algorithms☆466Updated 3 months ago
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,327Updated 3 months ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆472Updated last year
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆654Updated 6 months ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆1,717Updated last year
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,107Updated 3 years ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆286Updated last year
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆710Updated 10 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated last year
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆772Updated 3 months ago
- Collection of reinforcement learning algorithms☆2,504Updated 5 months ago
- Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code☆504Updated this week
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆631Updated 2 years ago
- A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.☆1,133Updated 2 years ago
- A collection of multi agent environments based on OpenAI gym.☆574Updated 4 months ago
- Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.☆830Updated 3 years ago