Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play
☆14May 1, 2018Updated 8 years ago
Alternatives and similar repositories for selfplay
Users that are interested in selfplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Apr 25, 2019Updated 7 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…☆30Nov 20, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Aug 12, 2022Updated 3 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- ☆33Jun 14, 2018Updated 7 years ago
- AI LaBuddyは、大規模言語モデルや深層学習を駆使して、研究室のさまざまなタスクを自動化・効率化するためのツール集です。研究者が行う実験や分析、論文の執筆や資料作成を支援し、Slack上でのコミュニケーションを活性化することができます☆22Jul 2, 2023Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆26Dec 1, 2020Updated 5 years ago
- ☆19Oct 13, 2021Updated 4 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Environment Probing Interaction Policies [ICLR 2019]☆30Jun 17, 2019Updated 6 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆23Apr 28, 2021Updated 5 years ago
- [TUD Thesis] Isaac Gym Envs with Drone Racing Tasks☆14Feb 23, 2025Updated last year
- MADDPG agent with collaboration and competition☆12Nov 9, 2018Updated 7 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆29Mar 27, 2021Updated 5 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- ☆16Jul 31, 2025Updated 10 months ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆38Mar 10, 2022Updated 4 years ago
- A standalone library to randomize various OpenAI Gym Environments☆66Sep 29, 2019Updated 6 years ago
- ☆16Oct 3, 2022Updated 3 years ago
- Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"☆12Apr 28, 2021Updated 5 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆32Aug 22, 2020Updated 5 years ago
- (NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning☆20Apr 8, 2019Updated 7 years ago
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.☆16Jun 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DeepNC: Deep Generative Network Completion☆10Dec 1, 2020Updated 5 years ago
- ☆43Oct 9, 2020Updated 5 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- ☆54Feb 19, 2018Updated 8 years ago
- ☆34May 31, 2019Updated 7 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago