qzed / irl-maxentLinks

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

☆290

Alternatives and similar repositories for irl-maxent

Users that are interested in irl-maxent are comparing it to the libraries listed below

Sorting:

toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆222Updated 4 years ago
seolhokim / InverseRL-Pytorch
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
☆66Updated 4 years ago
ermongroup / MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
☆210Updated 6 years ago
nishantkr18 / guided-cost-learning
Implementation of the paper https://arxiv.org/abs/1603.00448.
☆37Updated 4 years ago
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆329Updated 11 months ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆366Updated 3 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆317Updated 2 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
reinforcement-learning-kr / lets-do-irl
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
☆760Updated last year
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
hcnoh / gail-pytorch
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
☆164Updated 3 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆289Updated 4 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated last year
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆136Updated last year
liuzuxin / FSRL
🚀 A fast safe reinforcement learning library in PyTorch
☆204Updated 10 months ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆436Updated 2 years ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆207Updated 10 months ago
yrlu / irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
☆640Updated last year
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆188Updated 10 months ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 6 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆182Updated last year
Kaixhin / imitation-learning
Imitation learning algorithms
☆543Updated 4 months ago
xtma / dsac
Distributional Soft Actor Critic
☆58Updated 5 years ago
HuangJiaLian / AIRL_MountainCar
Adversarial Inverse Reinforcement Learning Implement For Mountain Car
☆36Updated 3 years ago