IandRover / meta_gradient_RLLinks

Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"

☆20

Alternatives and similar repositories for meta_gradient_RL

Users that are interested in meta_gradient_RL are comparing it to the libraries listed below

Sorting:

amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆94Updated 3 weeks ago
j3soon / dfac
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆32Updated 2 years ago
twni2016 / Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆32Updated 3 years ago
MadryLab / implementation-matters
☆132Updated 11 months ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆123Updated 4 years ago
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆81Updated last year
luckeciano / transformers-metarl
Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022
☆63Updated 2 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆71Updated 5 years ago
wisnunugroho21 / reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆53Updated 4 years ago
dragen1860 / MAML-Pytorch-RL
☆31Updated 2 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆130Updated last year
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
garrett4wade / revisiting_marl
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆22Updated 3 years ago
NeuralMMO / baselines
Baselines for Neural MMO -- new users should treat this repo as a starter project
☆47Updated 11 months ago
yeshenpy / ERL-Re2
This is the official implementation of ERL-Re2.
☆64Updated last year
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆163Updated last year
felix-kerkhoff / DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆29Updated 2 years ago
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆183Updated 10 months ago
crisbodnar / pderl
Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020
☆52Updated 11 months ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆56Updated 3 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆45Updated 3 years ago
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
theevann / MinimaxQ-Learning
Applying minimaxQ learning algorithm to 2 agents games
☆33Updated 7 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆179Updated 3 years ago
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
☆111Updated 2 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆87Updated 4 years ago