chscheller / minerl_agentLinks
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Updated 2 years ago
Alternatives and similar repositories for minerl_agent
Users that are interested in minerl_agent are comparing it to the libraries listed below
Sorting:
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Updated 5 years ago
- Deep RL agents with PyTorch☆36Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- ☆48Updated 2 months ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆79Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Updated last year
- ☆23Updated last year
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Updated last year
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago
- ForgER algorithm☆23Updated 3 years ago
- ☆26Updated 2 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 8 months ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 5 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- ☆31Updated 6 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Updated last year
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆47Updated 5 years ago
- ☆18Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- Advantage weighted Actor Critic for Offline RL☆52Updated 3 years ago
- Model-Based Offline Reinforcement Learning☆52Updated 5 years ago