Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
☆14May 1, 2018Updated 7 years ago
Alternatives and similar repositories for selfplay
Users that are interested in selfplay are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Apr 25, 2019Updated 6 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…☆31Nov 20, 2020Updated 5 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- ☆15Aug 12, 2022Updated 3 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆33Jun 14, 2018Updated 7 years ago
- AI LaBuddyは、大規模言語モデルや深層学習を駆使して、研究室のさまざまなタスクを自動化・効率化するためのツール集です。研究者が行う実験や分析、論文の執筆や資料作成を支援し、Slack上でのコミュニケーションを活性化することができます☆22Jul 2, 2023Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆26Dec 1, 2020Updated 5 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- ☆19Oct 13, 2021Updated 4 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Apr 28, 2021Updated 4 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- MADDPG agent with collaboration and competition☆12Nov 9, 2018Updated 7 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Mar 27, 2021Updated 4 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- ☆15Jul 31, 2025Updated 7 months ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- ☆38Mar 10, 2022Updated 4 years ago
- A standalone library to randomize various OpenAI Gym Environments☆66Sep 29, 2019Updated 6 years ago
- ☆16Oct 3, 2022Updated 3 years ago
- Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"☆12Apr 28, 2021Updated 4 years ago
- (NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning☆20Apr 8, 2019Updated 6 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆32Aug 22, 2020Updated 5 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 2 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 4 years ago
- DeepNC: Deep Generative Network Completion☆10Dec 1, 2020Updated 5 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- ☆54Feb 19, 2018Updated 8 years ago
- Official Repository for Westlake Deep Learning Course (2024)☆14Jun 6, 2024Updated last year