faameunier / MCTSnet
A PyTorch implementation of DeepMind's MCTSnet
โ18Updated 2 years ago
Alternatives and similar repositories for MCTSnet:
Users that are interested in MCTSnet are comparing it to the libraries listed below
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationโ40Updated 6 months ago
- ๐งถ Minimal PyTorch Soft Actor Critic (SAC) implementationโ38Updated 3 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.โ33Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learningโ61Updated 5 years ago
- โ17Updated 4 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisationโ14Updated 4 years ago
- My Body Is A Cageโ40Updated 4 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".โ24Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimizationโ31Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorchโ56Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradientsโ32Updated 5 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"โ19Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according โฆโ35Updated 11 months ago
- โ29Updated 4 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)โ20Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"โ43Updated 3 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learningโ33Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]โ38Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsโฆโ54Updated 2 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"โ44Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learningโ55Updated last year
- โ32Updated 9 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"โ44Updated last year
- General Modules for JAXโ64Updated last month
- A simple and easy to use implementation of the soft actor-critic algorithm.โ15Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024โ23Updated last year
- Deep Reinforcement Learning Framework done with PyTorchโ36Updated last month
- โ24Updated 9 months ago
- โ42Updated 2 years ago
- An implementation of MuZero in JAX.โ56Updated 2 years ago