This is a repository for Hidden-utility Self-Play.
☆26Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for HSP
Users that are interested in HSP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 4, 2024Updated last year
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆55Nov 22, 2025Updated 5 months ago
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆47Sep 11, 2024Updated last year
- ☆12Jan 4, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 2 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆159Nov 6, 2023Updated 2 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- Collection of RL Environments built using Madrona☆40Aug 11, 2023Updated 2 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated 2 years ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆20Sep 12, 2025Updated 7 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆112Apr 17, 2023Updated 3 years ago
- ☆16Jul 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a Human-like Upper-limb Motion Planner (HUMP) for the generation of arm-hand movements in humanoid robots.☆11Mar 4, 2022Updated 4 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆11Oct 21, 2024Updated last year
- ☆15Jan 16, 2024Updated 2 years ago
- An environment for table-carrying, a joint-action cooperative task.☆10Jan 8, 2024Updated 2 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Jun 1, 2023Updated 2 years ago
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆17Mar 17, 2025Updated last year
- ☆16Apr 6, 2022Updated 4 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆15May 11, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Service Robot Simulator☆11May 3, 2020Updated 5 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 7 months ago
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 4 months ago
- [AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".☆21Jul 26, 2025Updated 9 months ago
- ☆17Jun 25, 2025Updated 10 months ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆25Mar 10, 2026Updated last month
- An NLP research and data collection platform.☆17Mar 13, 2024Updated 2 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking☆13Mar 10, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A benchmark environment for fully cooperative human-AI performance.☆967Mar 22, 2025Updated last year
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Mar 19, 2026Updated last month
- ☆20Oct 9, 2024Updated last year
- ZOOT Plus 数 据定期备份☆18Updated this week
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated last year
- League of Legends, Teamfight Tactics Environment for RL(not complete)☆13Jun 20, 2020Updated 5 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year