RobertTLange / minimal-meta-rlLinks

Minimal A2C/A3C example of an LSTM-based meta-learner.

☆13

Alternatives and similar repositories for minimal-meta-rl

Users that are interested in minimal-meta-rl are comparing it to the libraries listed below

Sorting:

BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Updated 3 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
☆22Updated 3 years ago
ingambe / RayEnvWrapper
OpenAi's gym environment wrapper to vectorize them with Ray
☆23Updated 2 years ago
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆67Updated 2 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
soumik12345 / DDPG
Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control
☆26Updated 2 years ago
vwxyzjn / gym-pysc2
Gym wrapper for pysc2
☆10Updated 2 years ago
ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆24Updated last year
philipjball / SAC_PyTorch
🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation
☆38Updated 3 years ago
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆37Updated 4 months ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated last week
ElisevanderPol / PRAE
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
☆30Updated 5 years ago
lili-chen / SEER
Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.
☆21Updated 4 years ago
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 2 years ago
schmidtdominik / Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆45Updated 3 years ago
gkswamy98 / fast_irl
Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.
☆51Updated 2 years ago
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated 2 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆101Updated 3 years ago
google-deepmind / zipfian_environments
☆28Updated 2 years ago
alec-tschantz / planet
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Updated 5 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated last year
brentyi / minGPT-flax
GPT implementation in Flax
☆18Updated 3 years ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 11 months ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆11Updated 2 years ago
crisbodnar / pderl
Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020
☆52Updated last year
RedTachyon / coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
☆26Updated last year