PyTorch implementation of Memory Augmented Self-Play
☆52Oct 26, 2020Updated 5 years ago
Alternatives and similar repositories for memory-augmented-self-play
Users that are interested in memory-augmented-self-play are comparing it to the libraries listed below
Sorting:
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- RWA in pytorch☆14May 7, 2017Updated 8 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 7 years ago
- ☆17May 30, 2018Updated 7 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- ☆85May 29, 2019Updated 6 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax☆14Jan 1, 2019Updated 7 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- ☆44Dec 4, 2018Updated 7 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Unsupervised instance segmentation via active robot interaction☆76Jul 1, 2022Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆253Nov 22, 2018Updated 7 years ago
- ☆91Nov 15, 2019Updated 6 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- Python3 ROS Interface to Rethink Sawyer Robots with OpenAI Gym Compatibility☆62Apr 13, 2019Updated 6 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 8 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- The Variational Homoencoder: Learning to learn high capacity generative models from few examples☆34Jul 13, 2023Updated 2 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 6 years ago
- imperative programming in TensorFlow☆18Dec 12, 2016Updated 9 years ago
- Gated Path Planning Networks (ICML 2018)☆180Jan 23, 2019Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆18Mar 26, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆210Feb 22, 2019Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Learning with latent language☆51Mar 28, 2021Updated 4 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago