Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Mar 6, 2021Updated 4 years ago
Alternatives and similar repositories for AdMRL
Users that are interested in AdMRL are comparing it to the libraries listed below
Sorting:
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- PyTorch implementation of DARLA preprocessing models☆11Jan 30, 2018Updated 8 years ago
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- ☆33Aug 30, 2024Updated last year
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- ☆14May 31, 2022Updated 3 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Source code for the Self-Paced Deep Reinforcement Learning Experiments☆32Mar 24, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- ☆15Apr 5, 2023Updated 2 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆218Dec 27, 2022Updated 3 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)☆23May 12, 2023Updated 2 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆24Jul 6, 2023Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Jul 3, 2024Updated last year
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆25Jan 11, 2023Updated 3 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆156Aug 31, 2021Updated 4 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- This repository has moved to: https://github.com/tkipf/c-swm☆27Jan 5, 2020Updated 6 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Jun 20, 2019Updated 6 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago