RobertTLange / minimal-meta-rlLinks
Minimal A2C/A3C example of an LSTM-based meta-learner.
โ13Updated 4 years ago
Alternatives and similar repositories for minimal-meta-rl
Users that are interested in minimal-meta-rl are comparing it to the libraries listed below
Sorting:
- ๐งถ Minimal PyTorch Soft Actor Critic (SAC) implementationโ38Updated 3 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"โ38Updated 4 years ago
- OpenAi's gym environment wrapper to vectorize them with Rayโ22Updated 2 years ago
- Implementation of Proximal Policy Optimization in Jax+Flaxโ19Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithmsโ20Updated 5 months ago
- Curiosity-driven Exploration by Self-supervised Predictionโ21Updated 5 years ago
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.โ11Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationโ40Updated 7 months ago
- โ11Updated 4 years ago
- Docker containers of baseline agents for the Crafter environmentโ28Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. aโฆโ21Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]โ39Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationโ68Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RLโ29Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)โ27Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorchโ36Updated 2 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"โ100Updated 3 years ago
- Baselines for gymnax ๐คโ66Updated 2 years ago
- Gym wrapper for pysc2โ10Updated 2 years ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"โ19Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)โ10Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238โ47Updated 4 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)โ25Updated 3 years ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flowโ34Updated 7 months ago
- Scalable Opponent Shaping Experiments in JAXโ24Updated last year
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020โ31Updated 3 years ago
- โ36Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.โ33Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimationโ26Updated last year
- A modular implementation of PPO, and soon hopefully other algorithms.โ26Updated last year