LinZichuan/AdMRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LinZichuan/AdMRL)

LinZichuan / AdMRL

Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)

☆35

Alternatives and similar repositories for AdMRL

Users that are interested in AdMRL are comparing it to the libraries listed below

Sorting:

thomyphan / resilient-marl
View on GitHub
Resilient Multi-Agent Reinforcement Learning
☆10Nov 4, 2022Updated 3 years ago
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 4 years ago
PhilippeMorere / EMU-Q
View on GitHub
Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.
☆10Nov 8, 2018Updated 7 years ago
rythei / DARLA-PyTorch
View on GitHub
PyTorch implementation of DARLA preprocessing models
☆11Jan 30, 2018Updated 8 years ago
ankitkv / TD-VAE
View on GitHub
TD-VAE in PyTorch
☆10May 28, 2019Updated 6 years ago
laurimi / multiagent-prediction-reward
View on GitHub
Multi-agent active perception with prediction rewards
☆11Nov 13, 2020Updated 5 years ago
zachary2wave / DeepLearning-500-questions
View on GitHub
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系sc…
☆12Jul 5, 2019Updated 6 years ago
rraileanu / policy-dynamics-value-functions
View on GitHub
☆33Aug 30, 2024Updated last year
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 6 years ago
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 3 years ago
LinZichuan / emdqn
View on GitHub
Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018
☆62Sep 5, 2018Updated 7 years ago
Hwhitetooth / lirpg
View on GitHub
☆62Jun 22, 2018Updated 7 years ago
psclklnk / spdl
View on GitHub
Source code for the Self-Paced Deep Reinforcement Learning Experiments
☆32Mar 24, 2023Updated 2 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
gkahn13 / gcg-old
View on GitHub
a library for deep reinforcement learning, with applications for navigation
☆16Feb 6, 2018Updated 8 years ago
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 2 years ago
iclavera / learning_to_adapt
View on GitHub
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆218Dec 27, 2022Updated 3 years ago
RockySJ / ampo
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
StoneT2000 / trajectorytranslation
View on GitHub
Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)
☆23May 12, 2023Updated 2 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
dennisl88 / rand_param_envs
View on GitHub
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Feb 14, 2019Updated 7 years ago
wensun / Imitation-Learning-from-Observation
View on GitHub
☆24Jul 6, 2023Updated 2 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 6 years ago
Stilwell-Git / Randomized-Return-Decomposition
View on GitHub
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
☆19Mar 17, 2022Updated 3 years ago
lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 6 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 4 years ago
micahcarroll / uniMASK
View on GitHub
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆57Jul 3, 2024Updated last year
yidingjiang / love
View on GitHub
Code for the paper "Learning Options via Compression" at NeurIPS 2022
☆25Jan 11, 2023Updated 3 years ago
Div-Infinity / LISA
View on GitHub
(NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation
☆29Feb 22, 2023Updated 3 years ago
facebookresearch / deep_bisim4control
View on GitHub
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆156Aug 31, 2021Updated 4 years ago
StanfordVL / ac-teach
View on GitHub
Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
☆24Feb 15, 2023Updated 3 years ago
c-swm / c-swm
View on GitHub
This repository has moved to: https://github.com/tkipf/c-swm
☆27Jan 5, 2020Updated 6 years ago
HumanCompatibleAI / population-irl
View on GitHub
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Jun 20, 2019Updated 6 years ago
apexrl / bmpo
View on GitHub
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 2 years ago
JasonMa2016 / SMODICE
View on GitHub
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…
☆28Jan 12, 2023Updated 3 years ago
ksluck / Coadaptation
View on GitHub
Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…
☆27Jul 20, 2022Updated 3 years ago
haosulab / RPG
View on GitHub
Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization
☆27Jul 19, 2023Updated 2 years ago