abhayraw1 / planet-torchLinks
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆12Updated 5 years ago
Alternatives and similar repositories for planet-torch
Users that are interested in planet-torch are comparing it to the libraries listed below
Sorting:
- Implementation of Proximal Policy Optimization in Jax+Flax☆20Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Fast reinforcement learning research☆61Updated 10 months ago
- ☆45Updated last year
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 5 months ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Updated last year
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆18Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆18Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- ☆56Updated 3 years ago
- ☆42Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 2 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 4 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆20Updated 11 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆58Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 2 months ago
- ☆35Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year