uidilr / deepirl_chainer
Implementation of GAIL and AIRL using chinerrl
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for deepirl_chainer
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆26Updated 5 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆36Updated 2 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆21Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆28Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- High-quality reference implementations of various algorithms for Inverse Reinforcement Learning☆13Updated 6 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆70Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆19Updated 4 years ago
- Comp 781 Project☆8Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆14Updated 4 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆46Updated 5 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- ☆17Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- ☆16Updated 3 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Train agents on MiniGrid from human demonstrations using Inverse Reinforcement Learning☆14Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year