xtma / simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
☆37Updated 4 years ago
Related projects: ⓘ
- Hierarchical-DQN in pytorch (not actively maintained)☆65Updated 7 years ago
- Soft Actor-Critic☆142Updated 6 years ago
- ☆90Updated 9 months ago
- ☆115Updated last month
- Curiosity-driven Exploration by Self-supervised Prediction☆131Updated last year
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆158Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 4 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆59Updated 4 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆89Updated last year
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆45Updated 5 years ago
- ☆53Updated 6 months ago
- ☆41Updated 5 years ago
- ☆59Updated 6 years ago
- Adaptive Attention Span for Reinforcement Learning☆130Updated 4 years ago
- ☆44Updated last year
- Model-Based Offline Reinforcement Learning☆47Updated 3 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆69Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆88Updated 2 weeks ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆169Updated 2 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆33Updated 5 years ago
- Offline Reinforcement Learning Reading Group☆24Updated last year
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- ☆85Updated last month