mazpie / vime-pytorch
PyTorch implementation of the VIME paper (Variational Information Maximizing Exploration)
☆7Updated 2 years ago
Alternatives and similar repositories for vime-pytorch
Users that are interested in vime-pytorch are comparing it to the libraries listed below
Sorting:
- CORRO code☆35Updated 2 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆35Updated 2 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- ☆53Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆17Updated 4 years ago
- ☆49Updated 3 years ago
- ☆31Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆55Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 3 years ago
- ☆14Updated 3 years ago
- ☆55Updated 2 years ago
- Conservative Q Learning on top of SAC☆130Updated 2 years ago
- ☆29Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- ☆29Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- ☆42Updated 2 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆33Updated 4 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆19Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆52Updated 3 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆43Updated last month
- Asymmetric methods for partially observable reinforcement learning☆10Updated last month
- ☆15Updated last year