tesatory/selfplay

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tesatory/selfplay)

tesatory / selfplay

Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play

☆14

Alternatives and similar repositories for selfplay

Users that are interested in selfplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shagunsodhani / memory-augmented-self-play
View on GitHub
PyTorch implementation of Memory Augmented Self-Play
☆52Oct 26, 2020Updated 5 years ago
montrealrobotics / unsupervised-adr
View on GitHub
Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL
☆12Aug 4, 2020Updated 5 years ago
TMats / rlarch-pixyz-tutorial
View on GitHub
Pixyz Tutorial in RL Architecture Study Group
☆11Apr 25, 2019Updated 7 years ago
gkahn13 / CAPs
View on GitHub
☆33Oct 17, 2018Updated 7 years ago
NVlabs / sim-parameter-estimation
View on GitHub
The code accompaniment for the CoRL 2020 paper: A User's Guide to Calibrating Robotics Simulators (https://arxiv.org/abs/2011.08985), fro…
☆30Nov 20, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
matsuolab / virtual_desktop_docker
View on GitHub
A minimal toolset for running UI applications within docker isolated X11 environment
☆16Jan 11, 2026Updated 6 months ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
tesatory / hsp
View on GitHub
Hierarchical Self-Play
☆21Dec 5, 2018Updated 7 years ago
k1000dai / AlohaScorpion
View on GitHub
☆17Feb 18, 2026Updated 5 months ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
kenoharada / AI-LaBuddy
View on GitHub
AI LaBuddyは、大規模言語モデルや深層学習を駆使して、研究室のさまざまなタスクを自動化・効率化するためのツール集です。研究者が行う実験や分析、論文の執筆や資料作成を支援し、Slack上でのコミュニケーションを活性化することができます
☆22Jul 2, 2023Updated 3 years ago
eugval / sim2real_dynamics_simulation
View on GitHub
☆26Dec 1, 2020Updated 5 years ago
robot-learning-freiburg / CEILing
View on GitHub
☆19Oct 13, 2021Updated 4 years ago
airoa-org / rebake-legacy
View on GitHub
🍞 Bake your robot data into ML-ready formats
☆19Feb 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
djstrouse / InfoMARL
View on GitHub
using information theory to encourage agents to cooperate and compete
☆19Oct 4, 2018Updated 7 years ago
abhishm / PGQ
View on GitHub
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
☆15Mar 9, 2017Updated 9 years ago
Feryal / a3c-mujoco
View on GitHub
☆28Oct 9, 2017Updated 8 years ago
Wenxuan-Zhou / EPI
View on GitHub
Code for Environment Probing Interaction Policies [ICLR 2019]
☆30Jun 17, 2019Updated 7 years ago
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
sujoyp / subgoal-discovery
View on GitHub
Learning from Trajectories via Subgoal Discovery
☆12Dec 10, 2020Updated 5 years ago
Redrew / CAP
View on GitHub
☆10Dec 9, 2021Updated 4 years ago
Toyota / SGInit-VO
View on GitHub
☆16Jul 31, 2025Updated 11 months ago
tianbingsz / WALL-E
View on GitHub
Codebase for Efficient yet simple Reinforcement Learning Research Framework
☆28Jan 14, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
iexarchos / PolicyTransferKinDRA
View on GitHub
Code for ICRA2021 "Policy Transfer via Kinematic Domain Randomization and Adaptation"
☆12Apr 28, 2021Updated 5 years ago
danielnbarbosa / soccer_twos
View on GitHub
MADDPG agent with collaboration and competition
☆12Nov 9, 2018Updated 7 years ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
BrunoKM / deep-pilco-torch
View on GitHub
Deep PILCO PyTorch Implementation
☆15Mar 25, 2023Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
View on GitHub
We investigate the effect of populations on finding good solutions to the robust MDP
☆29Mar 27, 2021Updated 5 years ago
vlad17 / mve
View on GitHub
MVE: model-based value estimation
☆11Jul 30, 2018Updated 7 years ago
frt03 / mxt_bench
View on GitHub
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
☆14Feb 3, 2023Updated 3 years ago
airoa-org / yubi-sw
View on GitHub
☆30Jul 8, 2026Updated last week
montrealrobotics / domain-randomizer
View on GitHub
A standalone library to randomize various OpenAI Gym Environments
☆66Sep 29, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
srsohn / msgi
View on GitHub
ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
☆18Jul 16, 2020Updated 6 years ago
POSTECH-CVLab / style-agnostic-RL
View on GitHub
☆16Oct 3, 2022Updated 3 years ago
taochenshh / hcp
View on GitHub
(NeurIPS 2018) Hardware Conditioned Policies for Multi-Robot Transfer Learning
☆20Apr 8, 2019Updated 7 years ago
cross32768 / Dreamer_PyTorch
View on GitHub
Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…
☆32Aug 22, 2020Updated 5 years ago
david-simoes-93 / A3C3
View on GitHub
A deep reinforcement learning multi-agent algorithm, where a team learns to complete a task and communicate between agents.
☆16Jun 1, 2021Updated 5 years ago
gkahn13 / LaND
View on GitHub
☆43Oct 9, 2020Updated 5 years ago
zuoxingdong / DeepPILCO
View on GitHub
☆54Feb 19, 2018Updated 8 years ago