Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
☆14May 1, 2018Updated 7 years ago
Alternatives and similar repositories for selfplay
Users that are interested in selfplay are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Apr 25, 2019Updated 6 years ago
- The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…☆31Nov 20, 2020Updated 5 years ago
- ☆15Aug 12, 2022Updated 3 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 8 years ago
- ☆33Jun 14, 2018Updated 7 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- AI LaBuddyは、大規模言語モデルや深層学習を駆使して、研究室のさまざまなタスクを自動化・効率化するためのツール集です。研究者が行う実験や分析、論文の執筆や資料作成を支援し、Slack上でのコミュニケーションを活性化することができます☆22Jul 2, 2023Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆20Oct 13, 2021Updated 4 years ago
- ☆26Dec 1, 2020Updated 5 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Code for IROS 2020 paper: https://arxiv.org/abs/1910.04854☆27Aug 30, 2024Updated last year
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Mar 27, 2021Updated 4 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆32Aug 22, 2020Updated 5 years ago
- A practice of Recursive Feature Elimination (RFE) using Titanic dataset☆10Sep 5, 2021Updated 4 years ago
- ☆33May 31, 2019Updated 6 years ago
- Pythonによる制御工学入門改訂2版☆12Aug 22, 2024Updated last year
- Time-Causal VAE☆19Nov 8, 2024Updated last year
- ☆11Jul 24, 2025Updated 7 months ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- ☆36Dec 8, 2022Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- ☆43Oct 9, 2020Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Apr 28, 2021Updated 4 years ago
- ☆38Mar 10, 2022Updated 3 years ago
- Approximate Multiparametric Mixed-integer Convex Programming☆15May 16, 2019Updated 6 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- MuJoCo model for Blue☆10Mar 13, 2020Updated 5 years ago
- ☆15Jul 31, 2025Updated 7 months ago